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NUCLEIC ACID-ASSOCIATED PROTEINS 
TECHNICAL FIELD 

The invention relates to novel nucleic acids, nucleic acid-associated proteins encoded by these 
5 nucleic acids, and to the use of these nucleic acids and proteins in the diagnosis, treatment, and 

prevention of cell proliferative, neurological, developmental, and autoimmune/inflammatory disorders, 
and infections. The invention also relates to the assessment of the effects of exogenous compounds 
on the expression of nucleic acids and nucleic acid-associated proteins. 

10 BACKGROUND OF THE INVENTION 

Multicellular organisms are comprised of diverse cell types that differ dramatically both in 
structure and function. The identity of a cell is determined by its characteristic pattern of gene 
expression, and different cell types express overlapping but distinctive sets of genes throughout 
development. Spatial and temporal regulation of gene expression is critical for the control of cell 

15 proliferation, cell differentiation, apoptosis, and other processes that contribute to organismal 
development. Furthermore, gene expression is regulated in response to extracellular signals that 
mediate cell-cell communication and coordinate the activities of different cell types. Appropriate gene 
regulation also ensures that cells function efficiently by expressing only those genes whose functions 
are required at a given time. 

20 Transcription Factors 

Transcriptional regulatory proteins are essential for the control of gene expression. Some of 
these proteins function as transcription factors that initiate, activate, repress, or terminate gene 
transcription. Transcription factors generally bind to the promoter, enhancer, and upstream regulatory 
regions of a gene in a sequence-specific manner, although some factors bind regulatory elements 

25 within or downstream of a gene coding region. Transcription factors may bind to a specific region of 
DNA singly or as a complex with other accessory factors. (Reviewed in Lewin, B. (1990) Genes IV , 
Oxford University Press, New York, NY, and Cell Press, Cambridge, MA, pp. 554-570.) 

The double helix structure and repeated sequences of DNA create topological and chemical 
features which can be recognized by transcription factors. These features are hydrogen bond donor 

30 and acceptor groups, hydrophobic patches, major and minor grooves, and regular, repeated stretches 
of sequence which induce distinct bends in the helix. Typically, transcription factors recognize specific 
DNA sequence motifs of about 20 nucleotides in length. Multiple, adjacent transcription factor-binding 
motifs may be required for gene regulation. 
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Many transcription factors incorporate DNA-binding structural motifc which comprise either a 
helices or 6 sheets that bind to the major groove of DNA. Four well-characterized structural motifs 
are helix-turn-helix, zinc finger, leucine zipper, and helix-loop-helix. Proteins containing these motifs 
may act alone as monomers, or they may form homo- or heterodimers that interact with DNA. 

5 The helix-turn-helix motif consists of two cc helices connected at a fixed angle by a short 

chain of amino acids. One of the helices binds to the major groove. Helix-turn-helix motifs are 
exemplified by the homeobox motif which is present in homeodomain proteins. These proteins are 
critical for specifying the anterior-posterior body axis during development and are conserved 
throughout the animal kingdom. The Antennapedia and Ultrabithorax proteins of Drosophila 

10 melanogaster are prototypical homeodomain proteins. (Pabo, CO. and R.T. Sauer (1992) Annu. 
Rev. Biochem. 61:1053-1095.) 

The zinc finger motif, which binds zinc ions, generally contains tandem repeats of about 30 
amino acids consisting of periodically spaced cysteine and histidine residues. Examples of this 
sequence pattern, designated C2H2 and C3HC4 ("RING" finger), have been described. (Lewin, 

15 supra.) Zinc finger proteins each contain an oc helix and an antiparallel B sheet whose proximity and 
conformation are maintained by the zinc ion. Contact with DNA is made by the arginine preceding 
the a helix and by the second, third, and sixth residues of the a helix. Variants of the zinc finger motif 
include poorly defined cysteine-rich motifs which bind zinc or other metal ions. These motifs may not 
contain histidine residues and are generally nonrepetitive. The zinc finger motif may be repeated in a 

20 tandem array within a protein, such that the a helix of each zinc finger in the protein makes contact 
with the major groove of the DNA double helix. This repeated contact between the protein and the 
DNA produces a strong and specific DNA-protein interaction. The strength and specificity of the 
interaction can be regulated by the number of zinc finger motifs within the protein. Though originally 
identified in DNA-binding proteins as regions that interact directly with DNA, zinc fingers occur in a 

25 variety of proteins that do not bind DNA (Lodish, H. et al. (1995) Molecular Cell Biology . Scientific 
American Books, New York, NY, pp. 447-451). For example, Galcheva-Gargova, Z. et al. (1996) 
Science 272:1797-1802) have identified zinc finger proteins that interact with various cytokine 
receptors. 

The C2H2-type zinc finger signature motif contains a 28 amino acid sequence, including 2 
30 conserved Cys and 2 conserved His residues in a C-2-C-12-H-3-H type motif. The motif generally 
occurs in multiple tandem repeats. A cysteine-rich domain including the motif Asp-His-His-Cys 
(DHHC-CRD) has been identified as a distinct subgroup of zinc finger proteins. The DHHC-CRD 
region has been implicated in growth and development. One DHHC-CRD mutant shows defective 
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function of Ras, a small membrane-associated GTP-binding protein that regulates cell growth and 
differentiation, while other DHHC-CRD proteins probably function in pathways not involving Ras 
(Bartels, DJ. et al. (1999) Mol. Cell Biol. 19:6775-6787). 

Zinc-finger transcription factors are often accompanied by modular sequence motifs such as 

5 the Kruppel-associated box (KRAB) and the SCAN domain. For example, the 

hypoalphalipoproteinemia susceptibility gene ZNF202 encodes a SCAN box and a KRAB domain 
followed by eight C2H2 zinc-finger motifs (Honer, C et al. (2001) Biochim. Biophys. Acta 
15 17 :441-448). The SCAN domain is a highly conserved, leucine-rich motif of approximately 60 
amino acids found at the amino-terminal end of zinc finger transcription factors. SCAN domains are 

10 most often linked to C2H2 zinc finger motifs through their carboxyl-terminal end. Biochemical binding 
studies have established the SCAN domain as a selective hetero- and homotypic oligomerization 
domain. SCAN domain-mediated protein complexes may function to modulate the biological function 
of transcription factors (Schumacher, C. et al. (2000) J. Biol. Chem. 275:17173-17179). 

The KRAB (Kruppel-associated box) domain is a conserved amino acid sequence spanning 

15 approximately 75 amino acids and is found in almost one-third of the 300 to 700 genes encoding C2H2 
zinc fingers. The KRAB domain is found N-terminally with respect to the finger repeats. The KRAB 
domain is generally encoded by two exons; the KRAB-A region or box is encoded by one exon and 
the KRAB-B region or box is encoded by a second exon. The function of the KRAB domain is the 
repression of transcription. Transcription repression is accomplished by recruitment of either the 

20 KRAB-associated protein-1, a transcriptional corepressor, or the KRAB-A interacting protein. 
Proteins containing the KRAB domain are likely to play a regulatory role during development 
(Williams, A.J. et al. (1999) Mol. Cell Biol. 19:8526-8535). A subgroup of highly related human 
KRAB zinc finger proteins detectable in all human tissues is highly expressed in human T lymphoid 
cells (Bellefroid, EJ. et al. (1993) EMBO J. 12:1363-1374). The ZNF85 KRAB zinc finger gene, a 

25 member of the human ZNF91 family, is highly expressed in normal adult testis, in seminomas, and in 
the NT2/D1 teratocarcinoma cell line (Poncelet, D.A. et al. (1998) DNA Cell Biol.l7:931-943). 

The C4 motif is found in hormone-regulated proteins. The C4 motif generally includes only 2 
repeats. A number of eukaryotic and viral proteins contain a conserved cysteine-rich domain of 40 
to 60 residues (called C3HC4 zinc-finger or RING finger) that binds two atoms of zinc, and is 

30 probably involved in mediating protein-protein interactions. The 3D "cross-brace" structure of the zinc 
ligation system is unique to the RING domain. The spacing of the cysteines in such a domain is 
C-x(2)-C-x(9 to 39)-C-x(l to 3)-H-x(2 to3)-C-x(2)-C-x(4 to 48)-C-x(2)-C. The PHD finger is a 
C4HC3 zinc-finger-like motif found in nuclear proteins thought to be involved in chromatin-mediated 
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transcriptional regulation. 

GATA-typc transcription factors contain one or two zinc finger domains which bind 
specifically to a region of DNA that contains the consecutive nucleotide sequence G ATA. NMR 
studies indicate that the zinc finger comprises two irregular anti-parallel P sheets and an a helix, 
5 followed by a long loop to the C-terminal end of the finger (Ominchinski, J.G. (1993) Science 
261:438-446). The helix and the loop connecting the two (3-sheets contact the major groove of the 
DNA, while the C-terminal part, which determines the specificity of binding, wraps around into the 
minor groove. 

The LJM motif consists of about 60 amino acid residues and contains seven conserved 

10 cysteine residues and a histidine within a consensus sequence (Schmeichel, K.L. and Beckerle, M.C. 
(1994) CeU 79:21 1-219). The LIM family includes transcription factors and cytoskeletal proteins 
which maybe involved in development, differentiation, and cell growth. One example is actin-binding 
LIM protein, which may play roles in regulation of the cytoskeleton and cellular morphogenesis (Roof, 
DJ. et al. (1997) J. Cell Biol. 138:575-588). The N-terminal domain of actin-binding LIM protein has 

15 four double zinc finger motifs with the LIM consensus sequence. The C-terminal domain of actin- 
binding LIM protein shows sequence similarity to known actin-binding proteins such as dematin and 
villin. Actin-binding LIM protein binds to F-actin through its dematin-like C-terminal domain. The 
LIM domain may mediate protein-protein interactions with other LIM-binding proteins. 

Myeloid cell development is controlled by tissue-specific transcription factors. Myeloid zinc 

20 finger proteins (MZF) include MZF-1 and MZF-2. MZF-1 functions in regulation of the development 
of neutrophilic granulocytes. A murine homolog MZF-2 is expressed in myeloid cells, particularly in 
the cells committed to the neutrophilic lineage. MZF-2 is down-regulated by G-CSF and appears to 
have a unique function in neutrophil development (Murai, K. et al. (1997) Genes Cells 2:581-591). 

The leucine zipper motif comprises a stretch of amino acids rich in leucine which can form an 

25 amphipathic a helix. This structure provides the basis for dimerization of two leucine zipper proteins. 
The region adjacent to the leucine zipper is usually basic, and upon protein dimerization, is optimally 
positioned for binding to the major groove. Proteins containing such motifs are generally referred to as 
bZBP transcription factors. The leucine zipper motif is found in the proto-oncogenes Fos and Jun, 
which comprise the heterodimeric transcription factor API involved in cell growth and the 

30 determination of cell lineage (Papavassiliou, A.G. (1995) N. Engl. J. Med. 332:45-47). 

The helix-loop-helix motif (HLH) consists of a short a helix connected by a loop to a longer a 
helix. The loop is flexible and allows the two helices to fold back against each other and to bind to 
DNA. The transcription factor Myc contains a prototypical HLH motif. 
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The NF-kappa-B/Rel signature defines a family of eukaryotic transcription factors involved in 
oncogenesis, embryonic development, differentiation and immune response. Most transcription factors 
containing the Rel homology domain (RHD) bind as dimers to a consensus DN A sequence motif 
termed kappa-B. Members of the Rel family share a highly conserved 300 amino acid domain termed 

5 the Rel homology domain. The characteristic Rel C-terminal domain is involved in gene activation and 
cytoplasmic anchoring functions. Proteins known to contain the RHD domain include vertebrate 
nuclear factor NF-kappa-B, which is a heterodimer of a DNA-binding subunit and the transcription 
factor p65, mammalian transcription factor RelB, and vertebrate proto-oncogene c-rel, a protein 
associated with differentiation and lymphopoiesis (Kabrun, N. and Enrietto, P.J. (1994) Semin. Cancer 

10 BioL 5:103-112). 

A DNA binding motif termed ARID (AT-rich interactive domain) distinguishes an 
evolutionarily conserved family of proteins. The approximately 100-residue ARID sequence is present 
in a series of proteins strongly implicated in the regulation of cell growth, development, and 
tissue-specific gene expression. ARID proteins include Bright (a regulator of B -cell-specific gene 

15 expression), dead ringer (involved in development), and MRF-2 (which represses expression from the 
cytomegalovirus enhancer) (Dallas, P.B. et al. (2000) MoL Cell Biol. 20:3137-3146). 

The ELM2 (Egl-27 and MTA1 homology 2) domain is found in metastasis-associated protein 
MTA1 and protein ER1. The Caenorhabditis elegans gene egl-27 is required for embryonic 
patterning MTA1, a human gene with elevated expression in metastatic carcinomas, is a component 

20 of a protein complex with histone deacetylase and nucleosome remodelling activities (Solari, F. et al. 
(1999) Development 126:2483-2494). The ELM2 domain is usually found to the N terminus of a 
myb-like DNA binding domain. ELM2 is also found associated with an ARID DNA. 

The Iroquois (Irx) family of genes are found in nematodes, insects and vertebrates. Irx genes 
usually occur in one or two genomic clusters of three genes each and encode transcriptional 

25 controllers that possess a characteristic homeodomain. The Irx genes function early in development to 
specify the identity of diverse territories of the body. Later in development in both Drosophila and 
vertebrates, the Irx genes function again to subdivide those territories into smaller domains. (For a 
review of Iroquois genes, see Cavodeassi, F. et al. (2001) Development 128:2847-2855.) For 
example, mouse and human Irx4 proteins are 83% conserved and their 63-aa homeodomain is more 

30 than 93% identical to that of the Drosophila Iroquois patterning genes. Irx4 transcripts are 

predominantly expressed in the cardiac ventricles. The homeobox gene Irx4 mediates ventricular 
differentiation during cardiac development (Bruneau, B.G. et al. (2000) Dev. Biol. 217:266-77). 
Histidine triad (HIT) proteins share residues in distinctive dimeric, 10-stranded half-barrel 
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structures that form two identical purine nucleotide-binding sites. Hint (histidine triad 
nucleotide-binding protein)-related proteins, found in all forms of life, and fragile histidine triad 
(Fhit)-related proteins, found in animals and fungi, represent the two main branches of the HIT 
superfamily. Fhit homologs bind and cleave diadenosine polyphosphates. Fhit-Ap(n)A complexes 

5 appear to function in a proapoptotic tumor suppression pathway in epithelial tissues (Brenner C. et al. 
(1999) J. CellFhysiol.l81:179-187). 

Most transcription factors contain characteristic DNA binding motifs, and variations on the 
above motifs and new motifs have been and are currently being characterized. (Faisst, S. and S. 
Meyer (1992) Nucleic Acids Res. 20:3-26.) 

1° Chromatin Associated Proteins 

In the nucleus, DNA is packaged into chromatin, the compact organization of which limits the 
accessibility of DNA to transcription factors and plays a key role in gene regulation. (Lewin, supra, 
pp. 409-410.) The compact structure of chromatin is determined and influenced by chromatin- 
associated proteins such as the histones, the high mobility group (HMG) proteins, and the 

15 chromodomain proteins. There are five classes of histones, HI, H2A, H2B, H3, and H4, all of which 
are highly basic, low molecular weight proteins. The fundamental unit of chromatin, the nucleosome, 
consists of 200 base pairs of DNA associated with two copies each of H2 A, H2B, H3, and H4, HI 
links adjacent nucleosomes. HMG proteins are low molecular weight, non-histone proteins that may 
play a role in unwinding DNA and stabilizing single-stranded DNA. Chromodomain proteins play a 

20 key role in the formation of highly compacted heterochromatin, which is transcriptionally silent. 
Diseases and Disorders Related to Gene Regulation 

Many neoplastic disorders in humans can be attributed to inappropriate gene expression. 
Malignant cell growth may result from either excessive expression of tumor promoting genes or 
insufficient expression of tumor suppressor genes. (Cleary, M.L. (1992) Cancer Surv. 15:89-104.) 

25 The zinc finger-type transcriptional regulator WT1 is a tumor-suppressor protein that is inactivated in 
children with Wilm's tumor. The oncogene bcl-6, which plays an important role in large-cell 
lymphoma, is also a zinc-finger protein (Papavassiliou, A.G. (1995) N. Engl. J. Med. 332:45-47). 
Chromosomal translocations may also produce chimeric loci that fuse the coding sequence of one 
gene with the regulatory regions of a second unrelated gene. Such an arrangement likely results in 

30 inappropriate gene transcription, potentially contributing to malignancy. In Burkitt's lymphoma, for 
example, the transcription factor Myc is translocated to the immunoglobulin heavy chain locus, greatly 
enhancing Myc expression and resulting in rapid cell growth leading to leukemia (Latchman, D.S. 
(1996) N. Engl. J. Med. 334:28-33). 
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In addition, the immune system responds to infection or trauma by activating a cascade of 
events that coordinate the progressive selection, amplification, and mobilization of cellular defense 
mechanisms. A complex and balanced program of gene activation and repression is involved in this 
process. However, hyperactivity of the immune system as a result of improper or insufficient 

5 regulation of gene expression may result in considerable tissue or organ damage. This damage is well- 
documented in immunological responses associated with arthritis, allergens, heart attack, stroke, and 
infections. (Isselbacher et al. Harrison's Principles of Internal Medicine , 13/e, McGraw KB, Inc. and 
Teton Data Systems Software, 1996.) The causative gene for autoimmune polyendocrinopathy- 
candidiasis-ectodermal dystrophy (APECED) was recently isolated and found to encode a protein 

10 with two PHD-type zinc ringer motifs (Bjorses, P. et al. (1998) Hum. Mol. Genet. 7:1547-1553). 

Furthermore, the generation of multicellular organisms is based upon the induction and 
coordination of cell differentiation at the appropriate stages of development. Central to this process is 
differential gene expression, which confers the distinct identities of ceBs and tissues throughout the 
body. Failure to regulate gene expression during development could result in developmental disorders. 

15 Human developmental disorders caused by mutations in zinc finger-type transcriptional regulators 
include: urogenital developmental abnormalities associated with WT1; Greig cephalopolysyndactyly, 
Pallister-HaB syndrome, and postaxial Polydactyly type A (GLB), and Townes-Brocks syndrome, 
characterized by anal, renal, limb, and ear abnormalities (SALL1) (Engelkamp, D. and V. van : 
Heyningen (1996) Curr. Opin. Genet. Dev. 6:334-342; Kohlhase, J. et al. (1999) Am. J. Hum. Genet. 

20 64:435-445). 

Human acute leukemias involve reciprocal chromosome translocations that fuse the ALL-1 
gene located at chromosome region llq23 to a series of partner genes positioned on a variety of 
human chromosomes. The fused genes encode chimeric proteins. The AF17 gene encodes a protein 
of 1093 amino acids, containing a leucine-zipper dimerization motif located 3' of the fusion point and a 
25 cysteine-rich domain at the N terminus that shows homology to a domain within the protein Brl40 
(peregrin) (Prasad R. et al. (1994) Proc. Natl. Acad. Sci. USA 91:8107-8111). 
SYNTHESIS OF NUCLEIC ACIDS 
Polymerases 

DNA and RNA replication are critical processes for cell replication and function. DNA and 
30 RNA replication are mediated by the enzymes DNA and RNA polymerase, respectively, by a 
"templating" process in which the nucleptide sequence of a DNA or RNA strand is copied by 
complementary base-pairing into a complementary nucleic acid sequence of either DNA or RNA. 
However, there are fundamental differences between the two processes. 
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DNA polymerase catalyzes the stepwise addition of a deoxyribonucleotide to the 3 -OH end 
of a polynucleotide strand (the primer strand) that is paired to a second (template) strand. The new 
DNA strand therefore grows in the 5' to 3' direction (Alberts, B. et al. (1994) The Molecular Biology 
of the Cell . Garland Publishing Inc., New York, NY, pp 251-254). The substrates for the 

5 polymerization reaction are the corresponding deoxynucleotide triphosphates which must base-pair 
with the correct nucleotide on the template strand in order to be recognized by the polymerase. 
Because DNA exists as a double-stranded helix, each of the two strands may serve as a template for 
the formation of a new complementary strand. Each of the two daughter cells of a dividing cell 
therefore inherits a new DNA double helix containing one old and one new strand. Thus, DNA is said 

10 to be replicated "semiconservatively" by DNA polymerase. In addition to the synthesis of new DNA, 
DNA polymerase is also involved in the repair of damaged DNA as discussed below under "Ligases." 

In contrast to DNA polymerase, RNA polymerase uses a DNA template strand to 
"transcribe" DNA into RNA using ribonucleotide triphosphates as substrates. Like DNA 
polymerization, RNA polymerization proceeds in a 5* to 3' direction by addition of a ribonucleoside 

15 monophosphate to the 3 -OH end of a growing RNA chain. DNA transcription generates messenger 
RNAs (mRNA) that carry information for protein synthesis, as well as the transfer, ribosomal, and 
other RNAs that have structural or catalytic functions. In eukaryotes, three discrete RNA 
polymerases synthesize the three different types of RNA (Alberts, supra, pp. 367-368). RNA 
polymerase I makes the large ribosomal RNAs, RNA polymerase II makes the mRNAs that will be 

20 translated into proteins, and RNA polymerase IE makes a variety of small, stable RNAs, including 5S 
ribosomal RNA and the transfer RNAs (tRNA). In all cases, RNA synthesis is initiated by binding of 
the RNA polymerase to a promoter region on the DNA and synthesis begins at a start site within the 
promoter. Synthesis is completed at a stop (termination) signal in the DNA whereupon both the 
polymerase and the completed RNA chain are released. 

25 Lipases 

DNA repair is the process by which accidental base changes, such as those produced by 
oxidative damage, hydrolytic attack, or uncontrolled methylation of DNA, are corrected before 
replication or transcription of the DNA can occur. Because of the efficiency of the DNA repair 
process, fewer than one in a thousand accidental base changes causes a mutation (Alberts, supra, pp. 
30 245-249). The three steps common to most types of DNA repair are (1) excision of the damaged or 
altered base or nucleotide by DNA nucleases, (2) insertion of the correct nucleotide in the gap left by 
the excised nucleotide by DNA polymerase using the complementary strand as the template and, (3) 
sealing the break left between the inserted nucleotide(s) and the existing DNA strand by DNA ligase. 
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In the last reaction, DN A ligase uses the energy from ATP hydrolysis to activate the 5 ' end of the 
broken phosphodiester bond before forming the new bond with the 3 -OH of the DNA strand. In 
Bloom's syndrome, an inherited human disease, individuals are partially deficient in DNA ligation and 
consequently have an increased incidence of cancer (Alberts, supra p. 247). 
5 Nucleases 

Nucleases comprise enzymes that hydrolyze both DNA (DNase) and RNA (Rnase). They 
serve different purposes in nucleic acid metabolism. Nucleases hydrolyze the phosphodiester bonds 
between adjacent nucleotides either at internal positions (endonucleases) or at the terminal 3' or 5* 
nucleotide positions (exonucleases). A DNA exonuclease activity in DNA polymerase, for example, 

10 serves to remove improperly paired nucleotides attached to the 3-OH end of the growing DNA strand 
by the polymerase and thereby serves a "proofreading" function. As mentioned above, DNA 
endonuclease activity is involved in the excision step of the DNA repair process. 

RNases also serve a variety of functions. For example, RNase P is a ribonucleoprotein 
enzyme which cleaves the 5' end of pre-tRNAs as part of their maturation process. RNase H digests 

15 the RNA strand of an RNA/DNA hybrid. Such hybrids occur in cells invaded by retroviruses, and 
RNase H is an important enzyme in the retroviral replication cycle. Pancreatic RNase secreted by 
the pancreas into the intestine hydrolyzes RNA present in ingested foods. RNase activity in serum 
and cell extracts is elevated in a variety of cancers and infectious diseases (Schein, C.H. (1997) Nat. 
Biotechnol. 15:529-536). Regulation of RNase activity is being investigated as a means to control 

20 tumor angiogenesis, allergic reactions, viral infection and replication, and fungal infections. 
MODIFICATION OF NUCLEIC ACIDS 
Methvlases 

Methylation of specific nucleotides occurs in both DNA and RNA, and serves different 
functions in the two macromolecules. Methylation of cytosine residues to form 5 -methyl cytosine in 

25 DNA occurs specifically in CG sequences which are base-paired with one another in the DNA 
double-helix. The pattern of methylation is passed from generation to generation during DNA 
replication by an enzyme called "maintenance methylase" that acts preferentially on those CG 
sequences that are base-paired with a CG sequence that is already methylated. Such methylation 
appears to distinguish active from inactive genes by preventing the binding of regulatory proteins that 

30 "turn on" the gene, but permiting the binding of proteins that inactivate the gene (Alberts, supra pp. 
448-451). In RNA metabolism, "tRNA methylase" produces one of several nucleotide modifications 
in tRNA that affect the conformation and base-pairing of the molecule and facilitate the recognition of 
the appropriate mRNA codons by specific tRNAs. The primary methylation pattern is the 
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dimethylation of guanine residues to form N,N-dimethyl guanine. 

Helicases and Single-stranded Binding Proteins 

Helicases are enzymes that destabilize and unwind double helix structures in both DNA and 

RNA. Since DNA replication occurs more or less simultaneously on both strands, the two strands 
5 must first separate to generate a replication "fork" for DNA polymerase to act on. Two types of 

replication proteins contribute to this process, DNA helicases and single-stranded binding proteins. 

DNA helicases hydrolyze ATP and use the energy of hydrolysis to separate the DNA strands. 

Single-stranded binding proteins (SSBs) then bind to the exposed DNA strands, without covering the 

bases, thereby temporarily stabilizing them for templating by the DNA polymerase (Alberts, supra y pp. 
10 255-256). 

RNA helicases also alter and regulate RNA conformation and secondary structure. Like the 
DNA helicases, RNA helicases utilize energy derived from ATP hydrolysis to destabilize and unwind 
RNA duplexes. The most well-characterized and ubiquitous family of RNA helicases is the DEAD- 
box family, so named for die conserved B-type ATP-binding motif which is diagnostic of proteins in 

15 this family. Over 40 DEAD-box helicases have been identified in organisms as diverse as bacteria, 
insects, yeast, amphibians, mammals, and plants. DEAD-box helicases function in diverse processes 
such as translation initiation, splicing, ribosome assembly, and RNA editing, transport, and stability. 
Examples of these RNA helicases include yeast Drsl protein, which is involved in ribosomal RNA 
processing; yeast TTF1 and TTF2 and mammalian eEF-4A, which are essential to the initiation of RNA 

20 translation; and human p68 antigen, which regulates cell growth and division (Ripmaster, T.L. et al. 
(1992) Proc. Natl. Acad. Sci. USA 89:11131-11135; Chang, T.-H. et al. (1990) Proc. Natl. Acad. Sci. 
USA 87:1571-1575). These RNA helicases demonstrate strong sequence homology over a stretch of 
some 420 amino acids. Included among these conserved sequences are the consensus sequence for 
the A motif of an ATP binding protein; the "DEAD box" sequence, associated with ATPase activity; 

25 the sequence SAT, associated with the actual helicase unwinding region; and an octapeptide 

consensus sequence, required for RNA binding and ATP hydrolysis (Pause, A. et al. (1993) Mol. Cell 
Biol. 13:6789-6798). Differences outside of these conserved regions are believed to reflect 
differences in the functional roles of individual proteins (Chang, T.H. et al. (1990) Proc. Natl. Acad. 
Sci. USA 87:1571-1575). 

30 Some DEAD-box helicases play tissue- and stage-specific roles in spermatogenesis and 

embryogenesis. Overexpression of the DEAD-box 1 protein (DDX1) may play a role in the 
progression of neuroblastoma (Nb) and retinoblastoma (Rb) tumors (Godbout, R. et al. (1998) J. Biol. 
Chem. 273:21161-21168). These observations suggest that DDX1 may promote or enhance tumor 
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progression by altering the normal secondary structure and expression levels of RNA in cancer cells. 
Other DEAD-box helicases have been implicated either directly or indirectly in tumorigenesis. 
(Discussed in Godbout, supra.) For example, murine p68 is mutated in ultraviolet light-induced 
tumors, and human DDX6 is located at a chromosomal breakpoint associated with B-ceU lymphoma. 
5 Similarly, a chimeric protein comprised of DDX10 and NUP98, a nucleoporin protein, may be involved 
in the pathogenesis of certain myeloid malignancies. 
Topoisomerases 

Besides the need to separate DNA strands prior to replication, the two strands must be 
"unwound" from one another prior to their separation by DNA helicases. This function is performed 

10 by proteins known as DNA topoisomerases. DNA topoisomerase effectively acts as a reversible 

nuclease that hydrolyzes a phosphodiesterase bond in a DNA strand, permits the two strands to rotate 
freely about one another to remove the strain of the helix, and then rejoins the original phosphodiester 
bond between the two strands. Topoisomerases are essential enzymes responsible for the topological 
rearrangement of DNA brought about by transcription, replication, chromatin formation, 

15 recombination, and chromosome segregation. Superhelical coils are introduced into DNA by the 
passage of processive enzymes such as RNA polymerase, or by the separation of DNA strands by a 
helicase prior to replication. Knotting and concatenation can occur in the process of DNA synthesis, 
storage, and repair. All topoisomerases work by breaking a phosphodiester bond in the ribose- 
phosphate backbone of DNA. A catalytic tyrosine residue on the enzyme makes a nucleophilic attack 

20 on the scissile phosphodiester bond, resulting in a reaction intermediate in which a covalent bond is 
formed between the enzyme and one end of the broken strand. A tyrosine-DNA phosphodiesterase 
functions in DNA repair by hydrolyzing this bond in occasional dead-end topoisomerase I-DNA 
intermediates (Pouliot, JJ. et al. (1999) Science 286:552-555). 

Two types of DNA topoisomerase exist, types I and II. Type I topoisomerases work as 

25 monomers, making a break in a single strand of DNA while type II topoisomerases, working as 
homodimers, cleave both strands. DNA Topoisomerase I causes a single-strand break in a DNA 
helix to allow the rotation of the two strands of the helix about the remaining phosphodiester bond in 
the opposite strand. DNA topoisomerase II causes a transient break in both strands of a DNA helix 
where two double helices cross over one another. This type of topoisomerase can efficiently separate 

30 two interlocked DNA circles (Alberts, supra, pp.260-262). Type E topoisomerases are largely 

confined to proliferating cells in eukaryotes, such as cancer cells. For this reason they are targets for 
anticancer drugs. Topoisomerase II has been implicated in multi-drug resistance (MDR) as it appears 
to aid in the repair of DNA damage inflicted by DNA binding agents such as doxorubicin and 
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vincristine. 

The topoisomerase I family includes topoisomerases I and in (topo I and topo HI). The 
crystal structure of human topoisomerase I suggests that rotation about the intact DNA strand is 
partially controlled by the enzyme. In this "controlled rotation" model, protein-DNA interactions limit 

5 the rotation, which is driven by torsional strain in the DNA (Stewart, L. et al. (1998) Science 

379:1534-1541). Structurally, topo I can be recognized by its catalytic tyrosine residue and a number 
of other conserved residues in the active site region. Topo I is thought to function during transcription. 
Two topo Ills are known in humans, and they are homologous to prokaryotic topoisomerase I, with a 
conserved tyrosine and active site signature specific to this family. Topo III has been suggested to 

10 play a role in meiotic recombination. A mouse topo IE is highly expressed in testis tissue and its 
expression increases with the increase in the number of cells in pachytene (Seki, T. et al. (1998) J. 
Biol. Chem. 273:28553-28556). 

The topoisomerase II family includes two isozymes (Ila and HP) encoded by different genes. 
Topo II cleaves double stranded DNA in a reproducible, nonrandom fashion, preferentially in an AT 

15 rich region, but the basis of cleavage site selectivity is not known. Structurally, topo II is made up of 
four domains, the first two of which are structurally similar and probably distantly homologous to 
similar domains in eukaryotic topo I. The second domain bears the catalytic tyrosine, as well as a 
highly conserved pentapeptide. The Ha isoform appears to be responsible for unlinking DNA during 
chromosome segregation. Cell lines expressing Ha but not Hp suggest that HP is dispensable in 

20 cellular processes; however, HP knockout mice died perinatally due to a failure in neural development. 
That the major abnormalities occurred in predominantly late developmental events (neurogenesis) 
suggests that HP is needed not at mitosis, but rather during DNA repair (Y ang, X. et al. (2000) 
Science 287:131-134). 

Topoisomerases have been implicated in a number of disease states, and topoisomerase 

25 poisons have proven to be effective anti-tumor drugs for some human malignancies. Topo I is 
mislocalized in Fanconi's anemia, and may be involved in the chromosomal breakage seen in this 
disorder (Wunder, E. (1984) Hum. Genet. 68:276-281). Overexpression of a truncated topo HI in 
ataxia-telangiectasia (A-T) cells partially suppresses the A-T phenotype, probably through a dominant 
negative mechanism. This suggests that topo III is deregulated in A-T (Fritz, E. et al. (1997) Proc. 

30 Natl. Acad. Sci. USA 94:4538-4542). Topo III also interacts with the Bloom's Syndrome gene 
product, and has been suggested to have a role as a tumor suppressor (Wu, L. et al. (2000) J. Biol. 
Chem. 275:9636-9644). Aberrant topo II activity is often associated with cancer or increased cancer 
risk. Greatly lowered topo II activity has been found in some, but not all A-T cell lines (Mohamed, R. 
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et al. (1987) Biochem. Biophys. Res. Commun. 149:233-238). On the other hand, topo II can break 
DNA in the region of the A-T gene (ATM), which controls all DNA damage-responsive cell cycle 
checkpoints (Kauftnann, W.K. (1998) Proc. Soc. Exp. Biol. Med. 217:327-334). The ability of 
topoisomerases to break DNA has been used as the basis of antitumor drugs. Topoisomerase poisons 

5 act by increasing the number of dead-end covalent DNA-enzyme complexes in the cell, ultimately 
triggering cell death pathways (Fortune, J.M. and N. Osheroff (2000) Prog. Nucleic Acid Res. Mol. 
Biol. 64:221-253; Guichard, S.M. and M.K. Danks (1999) Curr. Opin. Oncol. 11:482-489). Antibodies 
against topo I are found in the serum of systemic sclerosis patients, and the levels of the antibody may 
be used as a marker of pulmonary involvement in the disease (Diot, E. et al. (1999) Chest 1 16:7 15- 

10 720). Finally, the DNA binding region of human topo I has been used as a DNA delivery vehicle for 
gene therapy (Chen, T.Y. et al. (2000) Appl. Microbiol. Biotechnol. 53:558-567). 
Recombinases 

Genetic recombination is the process of rearranging DNA sequences within an organism's 
genome to provide genetic variation for the organism in response to changes in the environment. 

15 DNA recombination allows variation in the particular combination of genes present in an individual's 
genome, as well as the timing and level of expression of these genes. (See Alberts, supra pp. 263- 
273.) Two broad classes of genetic recombination are commonly recognized, general recombination 
and site-specific recombination. General recombination involves genetic exchange between any 
homologous pair of DNA sequences usually located on two copies of the same chromosome. The 

20 process is aided by enzymes, recombinases, that "nick" one strand of a DNA duplex more or less 

randomly and permit exchange with a complementary strand on another duplex. The process does riot 
normally change the arrangement of genes in a chromosome. In site-specific recombination, the 
recombinase recognizes specific nucleotide sequences present in one or both of the recombining 
molecules. Base-pairing is not involved in this form of recombination and therefore it does not require 

25 DNA homology between the recombining molecules. Unlike general recombination, this form of 
recombination can alter the relative positions of nucleotide sequences in chromosomes. 
RNA METABOLISM 

Ribonucleic acid (RNA) is a linear single-stranded polymer of four nucleotides, ATP, CTP, 
UTP, and GTP. In most organisms, RNA is transcribed as a copy of deoxyribonucleic acid (DNA), 

30 the genetic material of the organism. In retroviruses RNA rather than DNA serves as the genetic 

material. RNA copies of the genetic material encode proteins or serve various structural, catalytic, or 
regulatory roles in organisms. RNA is classified according to its cellular localization and function. 
Messenger RNAs (mRNAs) encode polypeptides. Ribosomal RNAs (rRNAs) are assembled, along 
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with ribosomal proteins, into ribosomes, which are cytoplasmic particles that translate mRNA into 
polypeptides. Transfer RNAs (tRNAs) are cytosolic adaptor molecules that function in mRNA 
translation by recognizing both an mRNA codon and the amino acid that matches that codon. 
Heterogeneous nuclear RNAs (hnRNAs) include mRNA precursors and other nuclear RNAs of 

5 various sizes. Small nuclear RNAs (snRNAs) are a part of the nuclear spliceosome complex that 
removes intervening, non-coding sequences (introns) and rejoins exons in pre-mRNAs. 

Proteins are associated with RNA during its transcription from DNA, RNA processing, and 
translation of mRNA into protein. Proteins are also associated with RNA as it is used for structural, 
catalytic, and regulatory purposes. 

10 RNA Processing 

Ribosomal RNAs (rRNAs) are assembled, along with ribosomal proteins, into ribosomes, 
which are cytoplasmic particles that translate messenger RNA (mRNA) into polypeptides. The 
eukaryotic ribosome is composed of a 60S (large) subunit and a 40S (small) subunit, which together 
form the 80S ribosome. In addition to the 18S, 28S, 5S, and 5.8S rRNAs, ribosomes contain from 50 

15 to over 80 different ribosomal proteins, depending on the organism. Ribosomal proteins are classified 
according to which subunit they belong (i.e., L, if associated with the large 60S large subunit or S if 
associated with the small 40S subunit), E. coli ribosomes have been the most thoroughly studied and 
contain 50 proteins, many of which are conserved in all life forms. The structures of nine ribosomal 
proteins have been solved to less than 3.0D resolution (i.e., S5, S6, S17, LI, L6, L9, L12, L14, L30), 

20 revealing common motifs, such as b-a-b protein folds in addition to acidic and basic RNA-binding 
motifs positioned between b-strands. Most ribosomal proteins are believed to contact rRNA directly 
(reviewed in Liljas, A. and Garber, M. (1995) Curr. Opin. Struct. Biol. 5:721-727; see also Woodson, 
S.A. and Leontis, N.B. (1998) Curr. Opin. Struct Biol. 8:294-300; Ramakrishnan, V. and White, S.W. 
(1998) Trends Biochem. Sci. 23:208-212). 

25 Ribosomal proteins may undergo post-translational modifications or interact with other 

ribosome-associated proteins to regulate translation. For example, the highly homologous 40S 
ribosomal protein S6 kinases (S6K1 and S6K2) play a key role in the regulation of cell growth by 
controlling the biosynthesis of translational components which make up the protein synthetic apparatus 
(including the ribosomal proteins). In the case of S6K1, at least eight phosphorylation sites are 

30 believed to mediate kinase activation in a hierarchical fashion (Dufner and Thomas (1999) Exp. Cell. 
Res. 253:100-109). Some of the ribosomal proteins, including LI, also function as translational 
repressors by binding to polycistronic mRNAs encoding ribosomal proteins (reviewed in Liljas, supra 
and Garber, supra). 
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Recent evidence suggests that a number of ribosomal proteins have secondary functions 
independent of their involvement in protein biosynthesis. These proteins function as regulators of cell 
proliferation and, in some instances, as inducers of cell death. For example, the expression of human 
ribosomal protein LI 3 a has been shown to induce apoptosis by arresting cell growth in the G2/M 

5 phase of the cell cycle. Inhibition of expression of LI 3a induces apoptosis in target cells, which 

suggests that this protein is necessary, in the appropriate amount, for cell survival. Similar results have 
been obtained in yeast where inactivation of yeast homologues of L13a, rp22 and rp23, results in 
severe growth retardation and death. A closely related ribosomal protein, L7, arrests cells in Gl and 
also induces apoptosis. Thus, it appears that a subset of ribosomal proteins may function as cell cycle 

10 checkpoints and compose a new family of cell proliferation regulators. 

Mapping of individual ribosomal proteins on the surface of intact ribosomes is accomplished 
using 3D immunocryoelectronmicroscopy, whereby antibodies raised against specific ribosomal 
proteins are visualized. Progress has been made toward the mapping of LI, L7, and L12 while the 
structure of the intact ribosome has been solved to only 20-25D resolution and inconsistencies exist 

15 among different crude structures (Frank, J. (1997) Curr. Opin. Struct. Biol. 7:266-272). 

Three distinct sites have been identified on the ribosome. The aminoacyl-tRNA acceptor site 
(A site) receives charged tRNAs (with the exception of the initiator-tRNA). The peptidyl-tRNA site 
(P site) binds the nascent polypeptide as the amino acid from the A site is added to the elongating 
chain. Deacylated tRNAs bind in the exit site (E site) prior to their release from the ribosome. The 

20 structure of the ribosome is reviewed in Stryer, L. (1995) Biochemistry . W.H. Freeman and Company, 
New York NY, pp. 888-9081; Lodish, H. et al (1995) Molecular Cell Biology. Scientific American 
Books, New York NY, pp. 119-138; and Lewin, B (1997) Genes VI. Oxford University Press, Inc. 
New York, NY). 

Various proteins are necessary for processing of transcribed RNAs in the nucleus. Pre- 
25 mRNA processing steps include capping at the 5 ' end with methylguanosine, polyadenylating the 3 ' 
end, and splicing to remove introns. The primary RNA transript from DNA is a faithful copy of the 
gene containing both exon and intron sequences, and the latter sequences must be cut out of the RNA 
transcript to produce a mRNA that codes for a protein. This "splicing" of the mRNA sequence takes 
place in the nucleus with the aid of a large, multicomponent ribonucleoprotein complex known as a 
30 spliceosome. The spliceosomal complex is comprised of five small nuclear ribonucleoprotein particles 
(snRNPs) designated Ul, U2, U4, U5, and U6. Each snRNP contains a single species of snRNA and 
about ten proteins. The RNA components of some snRNPs recognize and base-pair with intron 
consensus sequences. The protein components mediate spliceosome assembly and the splicing 
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reaction. Autoantibodies to snRNP proteins are found in the blood of patients with systemic lupus 
erythematosus (Stryer, L. (1995) Biochemistry . W.H. Freeman and Company, New York NY, p. 
863). 

Several splicing regulatory proteins have been identified in Drosophila . Human (HsSWAP) 
5 and mouse (MmS WAP) homologs of the suppressor-of-white-apricot (su(wa)) gene have been cloned 
and characterized. HsSWAP and MmSWAP both have five highly homologous regions to su(wa), 
including an arginine/serine-rich domain and two repeated modules that are homologous to regions in 
the constitutive splicing factor, SPP91/PRP21. Mammalian SWAP mRNAs are alternatively spliced 
at the same splice sites as in Drosophila . The splice junctions of the Drosophila and mammalian 
10 regulated introns are conserved. Thus, research suggests that the mammalian SWAP gene functions 
as a vertebrate alternative splicing regulator (Denhez, F. and Lafyatis, R. (1994) Biol. Chem. 
269:16170-16179). 

Serine- and arginine-rich pre-mRNA splicing factors (SR proteins) are phosphorylated before 
they regulate splicing events. SRrp86 (SR-related protein of 86 kDa) is a novel SR protein containing 

15 a single amino-terminal RNA recognition motif and two carboxy-terminal domains rich in serine- 

arginine (SR) dipeptides. SRrp86 activates splicing in the presence of SRp20. However, it inhibits the 
in vitro and in vivo activation of specific splice sites by SR proteins, including ASF/SF2, SC35, and 
SRp55. Research suggests that pairwise combination of SRrp86 with specific SR proteins leads to 
altered splicing efficiency and differential splice site selection (Barnard, D.C. and Patton, J.G. (2000) 

20 Mol. Cell. Biol. 20:3049-3057). 

Heterogeneous nuclear ribonucleoproteins (hnRNPs) have been identified that have roles in 
splicing, exporting of the mature RNAs to the cytoplasm, and mRNA translation (Biamonti, G. et al. 
(1998) Clin. Exp. Rheumatol. 16:317-326), Some examples of hnRNPs include the yeast proteins 
Hrplp, involved in cleavage and polyadenylation at the 3' end of the RNA; Cbp80p, involved in 

25 capping the 5' end of the RNA; and Npl3p, a homolog of mammalian hnRNP Al, involved in export of 
mRNA from the nucleus (Shen, E.C. et al. (1998) Genes Dev. 12:679-691). HnRNPs have been 
shown to be important targets of the autoimmune response in rheumatic diseases (Biamonti, supra). 

Many snRNP and hnRNP proteins are characterized by an RNA recognition motif (RRM). 
(Reviewed in Birney, E. et al. (1993) Nucleic Acids Res. 21:5803-5816.) The RRM is about 80 amino 

30 acids in length and forms four P-strands and two a-helices arranged in an a /p sandwich. The RRM 
contains a core RNP-1 octapeptide motif along with surrounding conserved sequences. In addition to 
snRNP proteins, examples of RN A-binding proteins which contain the above motifs include 
heteronuclear ribonucleoproteins which stabilize nascent RNA and factors which regulate alternative 
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splicing. Alternative splicing factors include developmentally regulated proteins, specific examples of 
which have been identified in lower eukaryotes such as Drosophila melanogaster and 
Caenorhabdilis elegans. These proteins play key roles in developmental processes such as pattern 
formation and sex determination, respectively. (See, for example, Hodgkin, J. et al. (1994) 

5 Development 120:3681-3689.) 

The 3* ends of most eukaryote mRNAs are also posttranscriptionally modified by 
polyadenylation. Polyadenylation proceeds through two enzymatically distinct steps: (i) the 
endonucleolytic cleavage of nascent mRNAs at ris-acting polyadenylation signals in the 
3 -untranslated (non-coding) region and (ii) the addition of a poly(A) tract to the 5'mRNA fragment. 

10 The presence of as-acting RNA sequences is necessary for both steps. These sequences include 5 - 
AAUAAA-3' located 10-30 nucleotides upstream of the cleavage site and a less well-conserved GU- 
or U-rich sequence element located 10-30 nucleotides downstream of the cleavage site. Cleavage 
stimulation factor (CstF), cleavage factor I (CF I), and cleavage factor II (CF II) are involved in the 
cleavage reaction while cleavage and polyadenylation specificity factor (CPSF) and poly(A) 

15 polymerase (PAP) are necessary for both cleavage and polyadenylation. An additional enzyme, 

poly(A)-binding protein II (PAB II), promotes poly(A) tract elongation (Riiegsegger, U. et al. (1996) 

J. Biol. Chem. 271:6107-6113; and references within). 

TRANSLATION 

Correct translation of the genetic code depends upon each amino acid forming a linkage with 
20 the appropriate transfer RNA (tRNA). The aminoacyl-tRNA synthetases (aaRSs) are essential 
proteins found in all living organisms. The aaRSs are responsible for the activation and correct 
attachment of an amino acid with its cognate tRNA, as the first step in protein biosynthesis. 
Prokaryotic organisms have at least twenty different types of aaRSs, one for each different amino 
acid, while eukaryotes usually have two aaRSs, a cytosolic form and a mitochondrial form, for each 
25 different amino acid. The 20 aaRS enzymes can be divided into two structural classes. Class I 

enzymes add amino acids to the 2* hydroxyl at the 3' end of tRNAs while Class II enzymes add amino 
acids to the 3* hydroxyl at the 3' end of tRNAs. Each class is characterized by a distinctive topology 
of the catalytic domain. Class I enzymes contain a catalytic domain based on the nucleotide-binding 
Rossman 'fold*. In particular, a consensus tetrapeptide motif is highly conserved (Prosite Document 
30 PDOC00161, Aminoacyl-transfer RNA synthetases class-I signature). Class I enzymes are specific 
for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan, 
and valine. Class II enzymes contain a central catalytic domain, which consists of a seven-stranded 
antiparallel B- sheet domain, as well as N- and C- terminal regulatory domains. Class II enzymes are 
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separated into two groups based on the heterodimeric or homodimeric structure of the enzyme; the 
latter group is further subdivided by the structure of the N- and C-terminal regulatory domains 
(Hartlein, M. and Cusack, S. (1995) J. Mol. Evol. 40:519-530). Class II enzymes are specific for 
alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine. 
5 Certain aaRSs also have editing functions. IleRS, for example, can misactivate valine to form 

Val-tRNA 06 , but this product is cleared by a hydrolytic activity that destroys the mischarged product. 
This editing activity is located within a second catalytic site found in the connective polypeptide 1 
region (CP1), a long insertion sequence within the Rossman fold domain of Class I enzymes 
(Schimmel, P. et al. (1998) FASEB J. 12:1599-1609). AaRSs also play a role in tRNA processing. It 

10 has been shown that mature tRNAs are charged with their respective amino acids in the nucleus 
before export to the cytoplasm, and charging may serve as a quality control mechanism to insure the 
tRNAs are functional (Martinis, S.A. et al. (1999) EMBO J. 18:4591-4596). 

Under optimal conditions, polypeptide synthesis proceeds at a rate of approximately 40 amino 
acid residues per second. The rate of misincorporation during translation in on the order of 10 A and is 

15 primarily the result of aminoacyl-t-RNAs being charged with the incorrect amino acid. Incorrectly 
charged tRNA are toxic to cells as they result in the incorporation of incorrect amino acid residues 
into an elongating polypeptide. The rate of translation is presumed to be a compromise between the 
optimal rate of elongation and the need for translational fidelity. Mathematical calculations predict that 
10 -4 is indeed the maximum acceptable error rate for protein synthesis in a biological system (reviewed 

20 in Stryer, supra; and Watson, J. et al. (1987) The Benjamin/Cummings Publishing Co., Inc. Menlo 
Park, CA). A particularly error prone aminoacyl-tRNA charging event is the charging of tRNA cln 
with Gin. A mechanism exits for the correction of this mischarging event which likely has its origins in 
evolution. Gin was among the last of the 20 naturally occurring amino acids used in polypeptide 
synthesis to appear in nature. Gram positive eubacteria, cyanobacteria, Archeae, and eukaryotic 

25 organelles possess a noncanonical pathway for the synthesis of Gln-tRNA Gln based on the 

transformation of Glu-tRNA Gln (synthesized by Glu-tRNA synthetase, GluRS) using the enzyme Glu- 
tRNA Gbl amidotransferase (Ghi-AdT). The reactions involved in the transamidation pathway are as 
follows (Curnow, A.W. et al. (1997) Nucleic Acids Symposium 36:2-4): 

30 GluRS 

tRNA 01 " + Glu + ATP - Glu-tRNA 01n + AMP + PPj 

Glu-AdT 
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Ghi-tRN A Gbl + Gin + ATP - Gln-tRNA GIn + Ghi + ADP + P 
A similar enzyme, Asp-tRNA*" 1 amidotransferase, exists in Archaea, which transforms Asp- 
tRNA^ to Asn-tRNA*" 1 . Formylase, the enzyme that transforms Met-tRNA™* to fMet-tRNA 0 ** in 
eubacteria, is likely to be a related enzyme. A hydrolytic activity has also been identified that destroys 

5 mischarged Val-tRNA Dc (Schimmel, P. et al. (1998) FASEB J. 12:1599-1609). One likely scenario 
for the evolution of Glu-AdT in primitive life forms is the absence of a specific glutaminyl-tRNA 
synthetase (GlnRS), requiring an alternative pathway for the synthesis of Gln-tRNA Gln . In fact, 
deletion of the Glu-AdT operon in Gram positive bacteria is lethal (Curnow, A.W. et al. (1997) Proc. 
Natl. Acad. Sci. USA 94:11819-11826). The existence of GluRS activity in other organisms has been 

10 inferred by the high degree of conservation in translation machinery in nature; however, GluRS has not 
been identified in all organisms, including Homo sapiens. Such an enzyme would be responsible for 
ensuring translational fidelity and reducing the synthesis of defective polypeptides. 

In addition to their function in protein synthesis, specific aminoacyl tRNA synthetases also 
' play roles in cellular fidelity, RNA splicing, RNA trafficking, apoptosis, and transcriptional and 

15 translational regulation. For example, human tyrosyl-tRNA synthetase can be proteolytically cleaved 
into two fragments with distinct cytokine activities. The carboxy-terminal domain exhibits monocyte 
and leukocyte chemotaxis activity as well as stimulating production of myeloperoxidase, tumor 
necrosis factor-a, and tissue factor. The N-terminal domain binds to the interleukin-8 type A receptor 
and functions as an interleukin-8 -like cytokine. Human tyrosyl-tRNA synthetase is secreted from 

20 apoptotic tumor cells and may accelerate apoptosis (Wakasugi, K., and Schimmel, P. (1999) Science 
284:147-151). Mitochondrial Neurospora crassa TyrRS and S. cerevisiae LeuRS are essential 
factors for certain group I intron splicing activities, and human mitochondrial LeuRS can substitute for 
the yeast LeuRS in a yeast null strain. Certain bacterial aaRSs are involved in regulating their own 
transcription or translation (Martinis, supra). Several aaRSs are able to synthesize diadenosine 

25 oligophosphates, a class of signalling molecules with roles in cell proliferation, differentiation, and 
apoptosis (Kisselev, L.L et al. (1998) FEBS Lett. 427:157-163; Vartanian, A. et al. (1999) FEBS Lett. 
456:175-180). 

Autoantibodies against aminoacyl-tRNAs are generated by patients with autoimmune diseases 
such as rheumatic arthritis, dermatomyositis and polymyositis, and correlate strongly with complicating 
30 interstitial lung disease (ILD) (Freist, W. et al. (1999) Biol. Chem. 380:623-646; Freist, W. et al. 

(1996) Biol. Chem. Hoppe Seyler 377:343-356). These antibodies appear to be generated in response 
to viral infection, and coxsackie virus has been used to induce experimental viral myositis in animals. 

Comparison of aaRS structures between humans and pathogens has been useful in the design 
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of novel antibiotics (Schimmel, supra). Genetically engineered aaRSs have been utilized to allow site- 
specific incorporation of unnatural amino acids into proteins in vivo (Liu, D.R. et al. (1997) Proc. 
Nad. Acad. Sci. USA 94:10092-10097). 
tRNA Modifications 

5 The modified ribonucleotide, pseudouridine (\|/), is present ubiquitously in the anticodon regions 

of transfer RNAs (tRNAs), large and small ribosomal RNAs (rRNAs), and small nuclear RNAs 
(snRNAs). y is the most common of the modified nucleosides (i.e., other than G, A, U, and C) present 
in tRNAs. Only a few yeast tRNAs that are not involved in protein synthesis do not contain \jr 
(Cortese, R. et al. (1974) J. Biol. Chem. 249:1103-1108). The enzyme responsible for the conversion 

10 of uridine to \|/, pseudouridine synthase (pseudouridylate synthase), was first isolated from Salmonella 
typhimurium (Arena, F. et al. (1978) Nucleic Acids Res. 5:4523-4536). The enzyme has since been 
isolated from a number of mammals, including steer and mice (Green, CJ. et al. (1982) J. Biol. Chem. 
257:3045-52; and Chen, J. and Patton, J.R. (1999) RNA 5:409-419). tRNA pseudouridine synthases 
have been the most extensively studied members of the family. They require a thiol donor (e.g., 

15 cysteine) and a monovalent cation (e.g., ammonia or potassium) for optimal activity. Additional 
cofactors or high energy molecules (e.g., ATP or GTP) are not required (Green, supra). Other 
eukaryotic pseudouridine synthases have been identified that appear to be specific for rRNA 
(reviewed in Smith, CM. and Steitz, J. A. (1997) Cell 89:669-672) and a dual-specificity enzyme has 
been identified that uses both tRNA and rRNA substrates (Wizesinski, J. et al. (1995) RNA 1: 

20 437-448). The absence of \y in the anticodon loop of tRNAs results in reduced growth in both 
bacteria (Singer, C.E. et al. (1972) Nature New Biol. 238:72-74) and yeast (Lecointe, F. (1998) J. 
Biol. Chem. 273:1316-1323), although the genetic defect is not lethal. 

Another ribonucleoside modification that occurs primarily in eukaryotic cells is the conversion 
of guanosine to N^^-dimethylguanosine (m 2 2 G) at position 26 or 10 at the base of the D-stem of 

25 cytosolic and mitochondrial tRNAs. This posttranscriptional modification is believed to stabilize tRNA 
structure by preventing the formation of alternative tRNA secondary and tertiary structures. Yeast 
tRNA^ is unusual in that it does not contain this modification. The modification does not occur in 
eubacteria, presumably because the structure of tRNAs in these cells and organelles is sequence 
constrained and does not require posttranscriptional modification to prevent the formation of 

30 alternative structures (Steinberg, S. and Cedergren, R. (1995) RNA 1:886-891, and references within). 
The enzyme responsible for the conversion of guanosine to m 2 2 G is a 63 kDa 5-adenosylmethionine 
(SAM)-dependent tRNA N 1 ^-dimethyl-guanosine methyltransferase (also referred to as the TRM1 
gene product and herein referred to as TRM) (Edqvist, J. (1995) Biochimie 77:54-61). The enzyme 
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localizes to both the nucleus and the mitochondria (Li, J-M. et al. (1989) J. Cell Biol. 109:1411-1419). 
Based on studies with TRM from Xenopus laevis, there appears to be a requirement for base pairing 
at positions C11-G24 and G10-C25 immediately preceding the G26 to be modified, with other 
structural features of the tRNA also being required for the proper presentation of the G26 substrate 

5 (Edqvist. J. et al. (1992) Nucleic Acids Res. 20:6575-6581). Studies in yeast suggest that cells 

carrying a weak ochre tRNA suppressor (sup3-i) are unable to suppress translation termination in the 
absence of TRM activity, suggesting a role for TRM in modifying the frequency of suppression in 
eukaryotic cells (Niederberger, C. et al. (1999) FEBS Lett. 464:67-70), in addition to the more general 
function of ensuring the proper three-dimensional structures for tRNA. 

10 Translation Initiation 

Initiation of translation can be divided into three stages. The first stage brings an initiator 
transfer RNA (Met-tRNA f ) together with the 40S ribosomal subunit to form the 43 S preinitiation 
complex. The second stage binds the 43S preinitiation complex to the mRNA, followed by migration 
of the complex to the correct AUG initiation codon. The third stage brings the 60S ribosomal subunit 

15 to the 40S subunit to generate an 80S ribosome at the inititation codon. Regulation of translation 
primarily involves the first and second stage in the initiation process (V.M. Pain (1996) Eur. J. 
Biochem. 236:747-771). 

Several initiation factors, many of which contain multiple subunits, are involved in bringing an 
initiator tRNA and the 40S ribosomal subunit together. eIF2, a guanine nucleotide binding protein, 

20 recruits the initiator tRNA to the 40S ribosomal subunit. Only when eIF2 is bound to GTP does it 
associate with the initiator tRNA. eEF2B, a guanine nucleotide exchange protein, is responsible for 
converting eIF2 from the GDP-bound inactive form to the GTP-bound active form. Two other 
factors, eEFl A and eIF3 bind and stabilize the 40S subunit by interacting with the 18S ribosomal RNA 
and specific ribosomal structural proteins. eIF3 is also involved in association of the 40S ribosomal 

25 subunit with mRNA. The Met-tRNA f , elFIA, eDF3, and 40S ribosomal subunit together make up the 
43S preinitiation complex (Pain, supra). 

Additional factors are required for binding of the 43S preinitiation complex to an mRNA 
molecule, and the process is regulated at several levels. eBF4F is a complex consisting of three 
proteins: eIF4E, eIF4A, and eIF4G. eBF4E recognizes and binds to the mRNA 5 -terminal m 7 GTP 

30 cap, eIF4A is a bidirectional RNA-dependent helicase, and eIF4G is a scaffolding polypeptide. eEF4G 
has three binding domains. The N-terminal third of eIF4G interacts with eIF4E, the central third 
interacts with eIF4A, and the C-terminal third interacts with eIF3 bound to the 43S preinitiation 
complex. Thus, eIF4G acts as a bridge between the 40S ribosomal subunit and the mRNA (M. W. 
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Hentze (1997) Science 275:500-501). 

The ability of eIF4F to initiate binding of the 43S preinitiation complex is regulated by 
structural features of the mRNA. The mRNA molecule has an untranslated region (UTR) between 
the 5' cap and the AUG start codon. In some mRNAs this region forms secondary structures that 
5 impede binding of the 43S preinitiation complex. The helicase activity of eIF4A is thought to function 
in removing this secondary structure to facilitate binding of the 43S preinitiation complex (Pain, 
supra). 

Translation Elongation 

Elongation is the process whereby additional amino acids are joined to the initiator methionine 

10 to form the complete polypeptide chain. The elongation factors EF1 a, EF1 P Y» and EF2 are involved 
in elongating the polypeptide chain following initiation. EF1 a is a GTP-binding protein. In EF1 a's 
GTP-bound form, it brings an aminoacyl-tRNA to the ribosome* s A site. The amino acid attached to 
the newly arrived aminoacyl-tRNA forms a peptide bond with the initiatior methionine. The GTP on 
EF1 a is hydrolyzed to GDP, and EF1 a -GDP dissociates from the ribosome. EF1 p y binds EF1 a - 

15 GDP and induces the dissociation of GDP from EF1 a, allowing EF1 a to bind GTP and a new cycle 
to begin. 

As subsequent amino acyl-tRN As are brought to the ribosome, EF-G, another GTP-binding 
protein, catalyzes the translocation of tRN As from the A site to the P site and finally to the E site of 
the ribosome. This allows the ribosome and the mRNA to remain attached during translation. 
20 Translation Termination 

The release factor eRF carries out termination of translation. eRF recognizes stop codons in 
the mRNA, leading to the release of the polypeptide chain from the ribosome. 
Expression profiling 

Microarrays are analytical tools used in bioanalysis. A microarray has a plurality of molecules 
25 spatially distributed over, and stably associated with, the surface of a solid support. Microarrays of 
polypeptides, polynucleotides, and/or antibodies have been developed and find use in a variety of 
applications, such as gene sequencing, monitoring gene expression, gene mapping, bacterial 
identification, drug discovery, and combinatorial chemistry. 

One area in particular in which microarrays find use is in gene expression analysis. Array 
30 technology can provide a simple way to explore the expression of a single polymorphic gene or the 
expression profile of a large number of related or unrelated genes. When the expression of a single 
gene is examined, arrays are employed to detect the expression of a specific gene or its variants. 
When an expression profile is examined, arrays provide a platform for identifying genes that are tissue 
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specific, are affected by a substance being tested in a toxicology assay, are part of a signaling 
cascade, carry out housekeeping functions, or are specificaUy related to a particular genetic 
predisposition, condition, disease, or disorder. 
RNA Expression 

5 Array technology can provide a simple way to explore the expression of a single polymorphic 

gene or the expression profile of a large number of related or unrelated genes. When the expression 
of a single gene is examined, arrays are employed to detect the expression of a specific gene or its 
variants. When an expression profile is examined, arrays provide a platform for examining which 
genes are tissue specific, carrying out housekeeping functions, parts of a signaling cascade, or 

10 specificaUy related to a particular genetic predisposition, condition, disease, or disorder. 

The potential application of gene expression profiling is particularly relevant to measuring the 
toxic response to potential therapeutic compounds and of the metabolic response to therapeutic agents. 
Diseases treated with steroids and disorders caused by the metabolic response to treatment with 
steroids include adenomatosis, cholestasis, cirrhosis, hemangioma, Henoch-Schonlein purpura, 

15 hepatitis, hepatocellular and metastatic carcinomas, idiopathic thrombocytopenic purpura, porphyria, 
sarcoidosis, and Wilson disease. Response may be measured by comparing both the levels and 
sequences expressed in tissues from subjects exposed to or treated with steroid compounds such as 
mifepristone, progesterone, beclomethasone, medroxyprogesterone, budesonide, prednisone, 
dexamethasone, betamethasone, or danazol with the levels and sequences expressed in normal 

20 untreated tissue. 

Steroids are a class of lipid-soluble molecules, including cholesterol, bile acids, vitamin D, and 
hormones, that share a common four-ring structure based on cyclopentanoperhydrophenanthrene and 
that carrry out a wide variety of functions. Cholesterol, for example, is a component of cell 
membranes that controls membrane fluidity. It is also a precursor for bile acids which solubilize lipids 

25 and facilitate absorption in the small intestine during digestion. Vitamin D regulates the absorption of 
calcium in the small intestine and controls the concentration of calcium in plasma. Steroid hormones, 
produced by the adrenal cortex, ovaries, and testes, include glucocorticoids, mineralocorticoids, 
androgens, and estrogens. They control various biological processes by binding to intracellular 
receptors that regulate transcription of specific genes in the nucleus. Glucocorticoids, for example, 

30 increase blood glucose concentrations by regulation of gluconeogenesis in the liver, increase blood 
concentrations of fatty acids by promoting lipolysis in adipose tissues, modulate sensitivity to 
catcholamines in the central nervous system, and reduce inflammation. The principal 
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mineralocorticoid, aldosterone, is produced by the adrenal cortex and acts on cells of the distal tubules 
of the kidney to enhance sodium ion reabsorption. Androgens, produced by the interstitial cells of 
Leydig in the testis, include the male sex hormone testosterone, which triggers changes at puberty, the 
production of sperm and maintenance of secondary sexual characteristics. Female sex hormones, 
5 estrogen and progesterone, are produced by the ovaries and also by the placenta and adrenal cortex of 
the fetus during pregnancy. Estrogen regulates female reproductive processes and secondary sexual 
characteristics. Progesterone regulates changes in the endometrium during the menstrual cycle and 
pregnancy. 

Steroid hormones are widely used for fertility control and in anti-inflammatory treatments for 
10 physical injuries and diseases such as arthritis, asthma, and auto-immune disorders. Progesterone, a 
naturally occurring progestin, is primarily used to treat amenorrhea, abnormal uterine bleeding, or as a 
contraceptive. Endogenous progesterone is responsible for inducing secretory activity in the 
endometrium of the estrogen-primed uterus in preparation for the implantation of a fertilized egg and 
for the maintenance of pregnancy. It is secreted from the corpus hiteum in response to luteinizing 
15 hormone (LH). The primary contraceptive effect of exogenous progestins involves the suppression 
of the midcycle surge of LH. At the cellular level, progestins diffuse freely into target cells and bind 
to the progesterone receptor. Target cells include the female reproductive tract, the mammary gland, 
the hypothalamus, and the pituitary. Once bound to the receptor, progestins slow the frequency of 
release of gonadotropin releasing hormone from the hypothalamus and blunt the pre-ovulatory LH 
20 surge, thereby preventing follicular maturation and ovulation. Progesterone has minimal estrogenic 
and androgenic activity. Progesterone is metabolized hepatically to pregnanediol and conjugated with 
glucuronic acid. 

Medroxyprogesterone (MAH), also known as 6a-methyl-17-hydroxyprogesterone, is a 
synthetic progestin with a pharmacological activity about 15 times greater than progesterone. MAH is 

25 used for the treatment of renal and endometrial carcinomas, amenorrhea, abnormal uterine bleeding, 
and endometriosis associated with hormonal imbalance. MAH has a stimulatory effect on respiratory 
centers and has been used in cases of low blood oxygenation caused by sleep apnea, chronic 
obstructive pulmonary disease, or hypercapnia. 

Mifepristone, also known as RU-486, is an antiprogesterone drug that blocks receptors of 

30 progesterone. It counteracts the effects of progesterone, which is needed to sustain pregnancy. 
Mifepristone induces spontaneous abortion when administered in early pregnancy followed by 
treatment with the prostaglandin, misoprostol. Further, studies show that mifepristone at a 
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substantially lower dose can be highly effective as a postcoital contraceptive when administered within 
five days after unprotected intercourse, thus providing women with a "morning-after pill" in case of 
contraceptive failure or sexual assault. Mifepristone also has potential uses in the treatment of breast 
and ovarian cancers in cases in which tumors are progesterone-dependent. It interferes with steroid - 

5 dependent growth of brain meningiomas, and may be useful in treatment of endometriosis where it 
blocks the estrogen-dependent growth of endometrial tissues. It may also be useful in treatment of 
uterine fibroid tumors and Cushing's Syndrome. Mifepristone binds to glucocorticoid receptors and 
interferes with Cortisol binding. Mifepristone also may act as an anti-ghicocorticoid and be effective 
for treating conditions where Cortisol levels are elevated such as AIDS, anorexia nervosa, ulcers, 

10 diabetes, Parkinson's disease, multiple sclerosis, and Alzheimer's disease. 

Danazol is a synthetic steroid derived from ethinyl testosterone. Danazol indirectly reduces 
estrogen production by lowering pituitary synthesis of follicle-stimulating hormone and LH. Danazol 
also binds to sex hormone receptors in target tissues, thereby exhibiting anabolic, antiestrognic, and 
weakly androgenic activity. Danazol does not possess any progestogenic activity, and does not 

15 suppress normal pituitary release of corticotropin or release of Cortisol by the adrenal glands. Danazol 
is used in the treatment of endometriosis to relieve pain and inhibit endometrial cell growth. It is also 
used to treat fibrocystic breast disease and hereditary angioedema. 

Corticosteroids are used to relieve inflammation and to suppress the immune response. They 
inhibit eosinophil, basophil, and airway epithelial cell function by regulation of cytokines that mediate 

20 the inflammatory response. They inhibit leukocyte infiltration at the site of inflammation, interfere in 
the function of mediators of the inflammatory response, and suppress the humoral immune response. 
Corticosteroids are used to treat allergies, asthma, arthritis, and skin conditions. Beclomethasone is a 
synthetic glucocorticoid that is used to treat steroid-dependent asthma, to relieve symptoms associated 
with allergic or nonallergic (vasomotor) rhinitis, or to prevent recurrent nasal polyps following surgical 

25 removal. The anti-inflammatory and vasoconstrictive effects of intranasal beclomethasone are 5000 
times greater than those produced by hydrocortisone. Budesonide is a corticosteroid used to control 
symptoms associated with allergic rhinitis or asthma. Budesonide has high topical anti-inflammatory 
activity but low systemic activity. Dexamethasone is a synthetic glucocorticoid used in anti- 
inflammatory or immunosuppressive compositions. It is also used in inhalants to prevent symptoms of 

30 asthma. Due to its greater ability to reach the central nervous system, dexamethasone is usually the 
treatment of choice to control cerebral edema. Dexamethasone is approximately 20-30 times more 
potent than hydrocortisone and 5-7 times more potent than prednisone. Prednisone is metabolized in 
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the liver to its active form, prednisolone, a glucocorticoid with anti-inflammatory properties. 
Prednisone is approximately 4 times more potent than hydrocortisone and the duration of action of 
prednisone is intermediate between hydrocortisone and dexamethasone. Prednisone is used to treat 
allograft rejection, asthma, systemic lupus erythematosus, arthritis, ulcerative colitis, and other 
5 inflammatory conditions. Betamethasone is a synthetic glucocorticoid with antiinflammatory and 
immunosuppressive activity and is used to treat psoriasis and fungal infections, such as athlete's foot 
and ringworm. 

The anti-inflammatory actions of corticosteroids are thought to involve phospholipase A 2 
inhibitory proteins, collectively called lipocortins. Lipocortins, in turn, control the biosynthesis of potent 

10 mediators of inflammation such as prostaglandins and leukotrienes by inhibiting the release of the 
precursor molecule arachidonic acid. Proposed mechanisms of action include decreased IgE 
synthesis, increased number of P-adrenergic receptors on leukocytes, and decreased arachidonic acid 
metabolism. During an immediate allergic reaction, such as in chronic bronchial asthma, allergens 
bridge the IgE antibodies on the surface of mast cells, which triggers these cells to release 

15 chemotactic substances. Mast cell influx and activation, therefore, is partially responsible for the 
inflammation and hyperirritability of the oral mucosa in asthmatic patients. This inflammation can be 
retarded by administration of corticosteroids. 
CA3+ Steroids 

Array technology can provide a simple way to explore the expression of a single polymorphic 
20 gene or the expression profile of a large number of related or unrelated genes. When the expression 
of a single gene is examined, arrays are employed to detect the expression of a specific gene or its 
variants. When an expression profile is examined, arrays provide a platform for examining which 
genes are tissue specific, carrying out housekeeping functions, parts of a signaling cascade, or 
specifically related to a particular genetic predisposition, condition, disease, or disorder. 
25 The potential application of gene expression profiling is particularly relevant to measuring the 

toxic response to potential therapeutic compounds and of the metabolic response to therapeutic agents. 
Diseases treated with steroids and disorders caused by the metabolic response to treatment with 
steroids include adenomatosis, cholestasis, cirrhosis, hemangioma, Henoch-Schonlein purpura, 
hepatitis, hepatocellular and metastatic carcinomas, idiopathic thrombocytopenic purpura, porphyria, 
30 sarcoidosis, and Wilson disease. Response may be measured by comparing both the levels and 
sequences expressed in tissues from subjects exposed to or treated with steroid compounds such as 
mifepristone, progesterone, beclomethasone, medroxyprogesterone, budesonide, prednisone, 
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dexamethasone, betamethasone, or danazol with the levels and sequences expressed in normal 
untreated tissue. 

The effects upon liver metabolism and hormone clearance mechanisms are important to 
understand the pharmacodynamics of a drug. The human C3 A cell line is a clonal derivative of 

5 HepG2/C3 (hepatoma cell line, isolated from a 15-year-old male with liver tumor), which was selected 
for strong contact inhibition of growth. The use of a clonal population enhances the reproducibility of 
the cells. C3 A cells have many characteristics of primary human hepatocytes in culture: i) expression 
of insulin receptor and insulin-like growth factor II receptor; ii) secretion of a higji ratio of serum 
albumin compared with a-fetoprotein iii) conversion of ammonia to urea and ghitamine; iv) 

10 metabolize aromatic amino acids; and v) proliferate in glucose-free and insulin-free medium. The 
C3 A cell line is now well established as an in vitro model of the mature human liver (Mickelson et al. 
(1995) Hepatology 22:866-875; Nagendra et al. (1997) Am J Physiol 272:G408-G416). 
Vascular endothelial activation 

Activation of vascular endothelium is a central event in a wide range of physiological and 

15 disease processes such as vascular tone regulation, coagulation and thrombosis, atherosclerosis, 

inflammation and some infectious diseases. Blood vessel walls are composed of two tissue layers: an 
endothelial cell (EC) layer which comprises the lumenal surface of the vessel, and an underlying 
vascular smooth muscle cell (VSMC) layer. Through dynamic interactions with each other and with 
surrounding tissues, the vascular endothelium and smooth muscle tissues maintain vascular tone, 

20 control selective permeability of the vascular wall, direct vessel remodeling and angiogenesis, and 
modulate inflammatory and immune responses. 

The inflammatory response is a complex vascular reaction mediated by numerous cytokines, 
chemokines, growth factors, and other signaling molecules expressed by activated ECs, VSMCs and 
leukocytes. Inflammation protects the organism during trauma and infection, but can also lead to 

25 pathological conditions such as atherosclerosis. 

Human coronary artery endothelial cells (HCAECs) are primary cells derived from the 
endothelium of a human coronary artery. HCAECs are used as an experimental model for 
investigating the role of the endothelium in human vascular biology in vitro. Human umbilical artery 
endothelial cells (HUAECs) are primary cells derived from the endothelium of an umbilical artery. 

30 Human uterine myometrium microvascular endothelial cells (UtMVECs) are primary cells derived 
from the uterine myometrium microvasculature. Human Iliac Artery Endothelial Cells (HIAECs) are 
primary cells derived from the endothelium of an iliac artery. Human umbilical vein endothelial cells 
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(HUVECs) are a primary cell line derived from the endothe-lium of the human umbilical vein. 
ECV304 is a human endothelial line. 
Breast cancer 

There are more than 180,000 new cases of breast cancer diagnosed each year, and the 
5 mortality rate for breast cancer approaches 10% of all deaths in females between the ages of 45-54 
(Gish, K. (1999) AWIS Magazine 28:7-10). However the survival rate based on early diagnosis of 
localized breast cancer is extremely high (97%), compared with the advanced stage of the disease in 
which the tumor has spread beyond the breast (22%). Current procedures for clinical breast 
examination are lacking in sensitivity and specificity, and efforts are underway to develop 
10 comprehensive gene expression profiles for breast cancer that may be used in conjunction with 

conventional screening methods to improve diagnosis and prognosis of this disease (Perou, CM. et al. 
(2000) Nature 406:747-752). Mutations in two genes, BRCA1 and BRCA2, are known to greatly 
predispose a woman to breast cancer and maybe passed on from parents to children (Gish, supra ). 
However, this type of hereditary breast cancer accounts for only about 5% to 9% of breast cancers, 
15 while the vast majority of breast cancer is due to non-inherited mutations that occur in breast epithelial 
cells. 

Cell lines derived from human mammary epithelial cells at various stages of breast cancer 
provide a useful model to study the process of malignant transformation and tumor progression as it 
has been shown that these cell lines retain many of the properties of their parental tumors for lengthy 

20 culture periods (Wistuba, LI. et al. (1998) Clin. Cancer Res. 4:2931-2938). Such a model is 
particularly useful for comparing phenotypic and molecular characteristics of human mammary 
epithelial cells at various stages of malignant transformation. 

Estrogen stimulation plays a critical role in the development of normal mammary epithelium. 
Estradiol has a direct mitogenic effect on breast cancer cells, causing them to divide more rapidly by 

25 shortening their cell cycle. Also, estradiol induces a large number of enzymes and other proteins 
involved in nucleic acid synthesis in isolated breast cancer cell lines. Estradiol may increase the 
expression of the EGF receptor in response to TGF-a and EGR In addition, estrogens may promote 
proliferation of tumor cells by inducing the synthesis of TGF-a and EGF, and may block growth 
factors that would normally inhibit tumor cell growth. Estrogen receptor (ER) has been investigated 

30 extensively as a prognostic marker in breast cancer. Patients whose tumors display high levels of 
estrogen receptor have a significantly better prognosis than patients with receptor-negative tumors. 
When ER is lost or cells expressing the ER are selected against by therapeutic treatments, the tumor 
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becomes more aggressive. 
Lung cancer 

Lung cancer is the leading cause of cancer death for men and the second leading cause of 
cancer death for women in the U.S. Lung cancers are divided into four histopathologically distinct 

5 groups. Three groups (squamous cell carcinoma, adenocarcinoma, and large cell carcinoma) are 
classified as non-small cell hing cancers (NSCLCs). The fourth group of cancers is referred to as 
small cell lung cancer (SCLC). Deletions on chromosome 3 are common in this disease and are 
thought to indicate the presence of a tumor suppressor gene in this region. Activating mutations in K- 
ras are commonly found in lung cancer and are the basis of one of the mouse models for the disease. 

10 Ovarian Cancer 

Ovarian cancer is the leading cause of death from a gynecologic cancer. The majority of 
ovarian cancers are derived from epithelial cells, and 70% of patients with epithelial ovarian cancers 
present with late-stage disease. As a result, the long-term survival rates for this disease is very low. 
Identification of early-stage markers for ovarian cancer would significantly increase the survival rate. 

15 The molecular events that lead to ovarian cancer are poorly understood. Some of the known 

aberrations include mutation of p53 and microsatellite instability. Since gene expression patterns are 
likely to vary when normal ovary is compared to ovarian tumors, examination of gene expression in 
these tissues to identify possible markers for ovarian cancer is particularly relevant to improving 
diagnosis, prognosis, and treatment of this disease. 

20 Colon Cancer 

Colorectal cancer is the second leading cause of cancer deaths in the United States. Colon 
cancer is associated with aging, since 90% of the total cases occur in individuals over the age of 55. 
A widely accepted hypothesis is that several contributing genetic mutations must accumulate over time 
in an individual who develops the disease. To understand the nature of genetic alterations in colorectal 

25 cancer, a number of studies have focused on the inherited syndromes. The first known inherited 
syndrome, Familial Adenomatous Polyposis (FAP), is caused by mutations in the Adenomatous 
Polyposis Cob gene (APC), resulting in truncated or inactive forms of the protein. This tumor 
suppressor gene has been mapped to chromosome 5q. The second known inherited syndrome is 
hereditary nonpolyposis colorectal cancer (HNPCC), which is caused by mutations in mismatch repair 

30 genes. 

Although hereditary colon cancer syndromes occur in a small percentage of the population 
and most colorectal cancers are considered sporadic, knowledge from studies of the hereditary 
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syndromes can be generally applied. For instance, somatic mutations in APC occur in at least 80% of 

indiscriminate colon tumors. APC mutations are thought to be the initiating event in the disease. 

Other mutations occur subsequently. Approximately 50% of colorectal cancers contain activating 

mutations in ras t while 85% contain inactivating mutations in p53. Changes in these genes lead to 
5 gene expression changes in colon cancer. Less is understood about downstream targets of these 

mutations and the role they may play in cancer development and progression. 

Inflammatory/Immune Response 

Blood vessel walls are composed of two tissue layers: an endothelial cell (EC) layer which 

comprises the lumenal surface of the vessel, and an underlying vascular smooth muscle cell (VSMC) 
10 layer. Through dynamic interactions with each other and with surrounding tissues, the vascular 

endothelium and smooth muscle tissues maintain vascular tone, control selective permeability of the 

vascular wall, direct vessel remodeling and angiogenesis, and modulate inflammatory and immune 

responses. 

The inflammatory response is a complex vascular reaction mediated by numerous cytokines, 

15 chemokines, growth factors, and other signaling molecules expressed by activated ECs, VSMCs and 
leukocytes. Inflammation protects the organism during trauma and infection, but can also lead to 
pathological conditions such as atherosclerosis. The pro-inflammatory cytokines, interleukin (IL)-l 
and tumor necrosis factor (TNF), are secreted by a small number of activated macrophages or other 
cells and can set off a cascade of vascular changes, largely through their ability to alter gene 

20 expression patterns in ECs and VSMCs. These vascular changes include vasodilation and increased 
permeability of micro vasculature, edema, and leukocyte extravasation and transmigration across the 
vessel wall. Ultimately, leukocytes, particularly neutrophils and monocytes/macrophages, accumulate 
in the extravascular space, where they remove injurious agents by phagocytosis and oxidative killing, a 
process accompanied by release of toxic factors, such as proteases and reactive oxygen species. 

25 Tumor necrosis factor-a (TNF-a ) is a pleiotropic factor that exerts a variety of effects, such 

as growth promotion, growth inhibition, angiogenesis, cytotoxicity, inflammation, and 
immunomodulation. This cytokine is synthesized mainly by macrophages in response to invasive 
stimuli as an active 26 kDa membrane-bound precursor that is cleaved proteolytically to a mature 17 
kDa form with the prosequence polypeptide remaining associated to the membrane. The peptide is as 

30 bioactive as a 51 kDa trimer, which can be recognized by TNF-a receptors. TNF-a receptors are 
present in the majority of cell types. IL-1 and TNF induce pro-inflammatory, thrombotic, and anti- 
apoptotic changes in gene expression by signaling through receptors on the surface of ECs and 
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VSMCs; these receptors activate transcription factors such as NFAB as well as AP-1, IRF-1, and 
NF-GMa, leading to alterations in gene expression. Genes known to be differentially regulated in EC 
by IL-1 and TNF include E selectin, VCAM-1, ICAM-1, PAF, LfcBa, IAP-1, MCP-1, eotaxin, ENA- 
78, G-CSF, A20, ICE, and complement C3 component. A key event in inflammation, adhesion and 

5 transmigration of blood leukocytes across the vascular endothelium, for example, is mediated by 
increased expression of E selectin, P selectin, ICAM-1, and VCAM-1 on activated endothelium. 

Several investigators have examined changes in vascular cell gene expression associated with 
various inflammatory diseases or model systems. Examining human umbilical vein endothelial cells 
(HUVEC) activated by recombinant TNFa or conditioned medium from activated human primary 

10 monocytes, Horrevoets et al. (1999; Blood 93:3418-3431) identified 106 differentially regulated genes. 
In a similar approach, deVries et aL (2000; JBC 275:23939-23947) identified 40 differentially regulated 
genes in umbilical cord artery-derived smooth muscle cells activated by conditioned media from 
cultured macrophages after stimulation with oxidized LDL particles. In both studies, many of the 
identified genes were already known to be involved in inflammation. Comparing expression profiles 

15 from inflammatory diseased tissues, cultured macrophages, chondrocyte cell lines, primary 

chondrocytes, and synoviocytes, Heller et al. (1997; Proc Natl Acad Sci USA 94:2150-2155) identifed 
candidate genes involved in inflammatory responses, including TNF, IL-1 IL-6, IL-8 G-CSF, 
RANTES, and V-CAM. From this candidate gene set, tissue inhibitor of metalloproteinase 1, ferritin 
light chain, and manganese superoxide dismutase were found to be differentially expressed in 

20 rheumatoid arthritis (RA) relative to inflammatory bowel disease (IBD). Further, 11^3, chemokine 
Groa, and metalloproteinase matrix metallo-elastase were expressed in both RA and IBD. Most 
recently, in an analysis of cultured aortic smooth muscle cells treated with TNFa, Haley et al. (2000; 
Circulation 102:2185-2189) found a 20-fold increase in eotaxin, an eosinophil chemotactic factor. The 
overexpression of eotaxin and its receptor CCR3 in atherosclerotic lesions was confirmed by northern 

25 analysis. 

Array technology can provide a simple way to explore the expression of a single polymorphic 
gene or the expression profile of a large number of related or unrelated genes. When the expression 
of a single gene is examined, arrays are employed to detect the expression of a specific gene or its 
variants. When an expression profile is examined, arrays provide a platform for identifying genes that 
30 are tissue specific, carry out housekeeping functions, are part of a signaling cascade, or are 

specifically related to a particular genetic predisposition, condition, disease, or disorder. The potential 
application of gene expression profiling is particularly relevant to improving diagnosis, prognosis, and 
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treatment of disease. For example, both the levels and sequences expressed in tissues from subjects 
with coronary artery disease may be compared with the levels and sequences expressed in normal 
vascular tissue. 

DNA-based array technology is also useful as a method of analyzing cell signaling pathways. 

5 For example, protein kinase C (PKC) is a family of serine/threonine kinases which play a critical role 
in many signal transduction pathways in the cell (Kanashiro, C.A. and Khalil, R. A. (1998) Clin. Exp. 
Pharmacol. Physiol. 25:974-985). Phorbol 12-myristate 13-acetate (PMA) is a broad activator of the 
protein kinase C-dependent pathways. PMA promotes tumors in cells by over activating PKC 
pathways. PKC is also affected by intracellular calcium levels. Internal calcium flux has been shown 

10 have many effects on cells, particular in the areas of cell activation and proliferation (Cole, K, and 

Kohn, E. (1994) Cancer Metastasis Rev. 13:31-44). Ionomycin is a calcium ionophore that permits the 
entry of calcium into the cell, hence increasing the cytosolic calcium concentration. Thus the 
combination of PMA and ionomycin activates two of the major signaling pathways used by 
mammalian cells to interact with their environment In T cells, for example, the combination of PMA 

15 and ionomycin mimics the type of secondary signaling events elicited during optimal T cell activation, 
while in endothelial cells, PMA and ionomycin reproduce the effect of inflammatory mediatiors. 

Cell lines are widely used in experimental biology to model human cell behavior. Jurkat, an 
acute T-cell leukemia cell line that grows actively in the absence of external stimuli, is used to study 
signaling in human T cells. ECV304, a cell line derived from the endothelium of the human umbilical 

20 vein, is used to study the functional biology of human endothelial cells. 
Neurological disorders 

Characterization of region-specific gene expression in the human brain provides a context and 
background for molecular neurobiology on a variety of neurological disorders. For example, 
Alzheimer's disease (AD) is a progressive, neurodestructive process of the human neocortex, 

25 characterized by the deterioration of memory and higher cognitive function. A progressive and 
irreversible brain disorder, AD is characterized by three major pathogenic episodes involving (a) an 
aberrant processing and deposition of beta-amyloid precursor protein (betaAPP) to form neurotoxic 
beta-amyloid (betaA) peptides and an aggregated insoluble polymer of betaA that forms the senile 
plaque, (b) the establishment of intraneuronal neuritic tau pathology yielding widespread deposits of 

30 agyrophilic neurofibrillary tangles (NFT) and (c) the initiation and proliferation of a brain-specific 
inflammatory response. These three seemingly disperse attributes of AD etiopathogenesis are linked 
by the fact that proinflammatory microglia, reactive astrocytes and their associated cytokines and 
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chemokines are associated with the biology of the microtubule associated protein tau, betaA speciation * 
and aggregation. Missense mutations in the presenilin genes PS1 and PS2, implicated in early onset 
familial AD, cause abnormal betaAPP processing with resultant overproduction of betaA42 and 
related neurotoxic peptides. Specific betaA fragments such as betaA42 can further potentiate 

5 proinflammatory mechanisms. Expression of the inducible oxidoreductase cyclooxygenase-2 and 
cytosolic phospholipase A2 (cPLA2) are strongly activated during cerebral ischemia and trauma, 
epilepsy and AD, indicating the induction of proinflammatory gene pathways as a response to brain 
injury. Neurotoxic metals such as aluminum and zinc, both implicated in AD etiopathogenesis, and 
arachidonic acid, a major metabolite of brain cPLA2 activity, each polymerize hyperphosphorylated 

10 tau to form NFT-like bundles. Studies have identified a reduced risk for AD in patients aged over 70 
years who were previously treated with non-steroidal anti-inflammatory drugs for non-CNS afflictions 
that include arthritis. (For a review of the interrelationships between the mechanisms of PS1 , PS2 and 
betaAPP gene expression, tau and betaA deposition and the induction, regulation and proliferation in 
AD of the neuroinflammatory response, see Lukiw W.J, and Bazan N.G.(2000) Neurochem. Res. 

15 2000 25:1173-1184). 

The discovery of new nucleic acid-associated proteins, and the polynucleotides encoding them, 
satisfies a need in the art by providing new compositions which are useful in the diagnosis, prevention, 
and treatment of cell proliferative, neurological, developmental, and autoimmune/inflammatory 
disorders, and infections, and in the assessment of the effects of exogenous compounds on the 

20 expression of nucleic acid and amino acid sequences of nucleic acid-associated proteins. 

There is a need in the art for new compositions, including nucleic acids and proteins, for the 
diagnosis, prevention, and treatment of cell proliferative, neurological, developmental, and 
autoimmune/inflammatory disorders, and infections. 



25 SUMMARY OF THE INVENTION 

Various embodiments of the invention provide purified polypeptides, nucleic acid-associated 
proteins, referred to collectively as 'NAAP' and individually as 'NAAP-1/ 'NAAP-2,' 4 NAAP-3/ 
'NAAP-4/ 'NAAP-5,' 4 NAAP-6,' 4 NAAP-7,' 4 NAAP-8,' 'NAAP-9/ 4 NAAP-10,' 'NAAP-11/ 
'NAAP-12,' 'NAAP-13,' 'NAAP-14/ 'NAAP-15,' 4 NAAP-16,' 'NAAP-17,' 'NAAP-18,' 'NAAP- 
30 19,' 'NAAP-20/ 'NAAP-21,* 4 NAAP-22/ 'NAAP-23,' 'NAAP-24/ 'NAAP-25/ 'NAAP-26,* 

'NAAP-27,* 'NAAP-28,' 4 NAAP-29,' 4 NAAP-30,' 'NAAP-31,' 'NAAP-32,' 4 NAAP-33,' 4 NAAP- 
34,' 4 NAAP-35,' 4 NAAP-36/ 4 NAAP-37,' 'NAAP^S/ 4 NAAP-39/ 4 NAAP-40,' 4 NAAP-41/ 
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•NAAP-42/ 'NAAP-43,' 'NAAP-44,' *NAAP-45/ 'NAAP-46,' 'NAAP-47,* 'NAAP-48,' 4 NAAP- 
49/ 'NAAP-50,' 'NAAP-Sl,' 'NAAP-52/ 'NAAP-53/ 4 NAAP-54,' 'NAAP-55,' 'NAAP-56,' and 
*NAAP-57' and methods for using these proteins and their encoding polynucleotides for the detection, 
diagnosis, and treatment of diseases and medical conditions. Embodiments also provide methods for 
5 utilizing the purified nucleic acid-associated proteins and/or their encoding polynucleotides for 
facilitating the drug discovery process, including determination of efficacy, dosage, toxicity, and 
pharmacology. Related embodiments provide methods for utilizing the purified nucleic acid-associated 
proteins and/or their encoding polynucleotides for investigating the pathogenesis of diseases and 
medical conditions. 

10 An embodiment provides an isolated polypeptide selected from the group consisting of a) a 

polypeptide comprising an amino acid sequence selected from the group consisting of SEQ ID NO:l- 
57, b) a polypeptide comprising a naturally occurring amino acid sequence at least 90% identical or at 
least about 90% identical to an amino acid sequence selected from the group consisting of SEQ ID 
NO: 1-57, c) a biologically active fragment of a polypeptide having an amino acid sequence selected 

15 from the group consisting of SEQ ID NO:l-57, and d) an immunogenic fragment of a polypeptide 
having an amino acid sequence selected from the group consisting of SEQ ID NO:l-57. Another 
embodiment provides an isolated polypeptide comprising an amino acid sequence of SEQ ID NO:l-57. 

Still another embodiment provides an isolated polynucleotide encoding a polypeptide selected 
from the group consisting of a) a polypeptide comprising an amino acid sequence selected from the 

20 group consisting of SEQ ID NO:l-57, b) a polypeptide comprising a naturally occurring amino acid 
sequence at least 90% identical or at least about 90% identical to an amino acid sequence selected 
from the group consisting of SEQ ED NO:l-57, c) a biologically active fragment of a polypeptide 
having an amino acid sequence selected from the group consisting of SEQ ID NO:l-57, and d) an 
immunogenic fragment of a polypeptide having an amino acid sequence selected from the group 

25 consisting of SEQ ID NO:l-57. In another embodiment, the polynucleotide encodes a polypeptide 
selected from the group consisting of SEQ ID NO:l-57. In an alternative embodiment, the 
polynucleotide is selected from the group consisting of SEQ ID NO:58-114. 

Still another embodiment provides a recombinant polynucleotide comprising a promoter 
sequence operably linked to a polynucleotide encoding a polypeptide selected from the group 

30 consisting of a) a polypeptide comprising an amino acid sequence selected from the group consisting 
of SEQ ID NO: 1-57, b) a polypeptide comprising a naturally occurring amino acid sequence at least 
90% identical or at least about 90% identical to an amino acid sequence selected from the group 
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consisting of SEQ ID NO:l-57, c) a biologically active fragment of a polypeptide having an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-57, and d) an immunogenic fragment of 
a polypeptide having an amino acid sequence selected from the group consisting of SEQ ID NO:l-57. 
Another embodiment provides a cell transformed with the recombinant polynucleotide. Yet another 

5 embodiment provides a transgenic organism comprising the recombinant polynucleotide. 

Another embodiment provides a method for producing a polypeptide selected from the group 
consisting of a) a polypeptide comprising an amino acid sequence selected from the group consisting 
of SEQ ID NO: 1-57, b) a polypeptide comprising a naturally occurring amino acid sequence at least 
90% identical or at least about 90% identical to an amino acid sequence selected from the group 

10 consisting of SEQ ID NO:l-57, c) a biologically active fragment of a polypeptide having an amino acid 
sequence selected from the group consisting of SEQ ID NO:l-57, and d) an immunogenic fragment of 
a polypeptide having an amino acid sequence selected from the group consisting of SEQ ID NO:l-57. 
The method comprises a) culturing a cell under conditions suitable for expression of the polypeptide, 
wherein said cell is transformed with a recombinant polynucleotide comprising a promoter sequence 

15 operably linked to a polynucleotide encoding the polypeptide, and b) recovering the polypeptide so 
expressed. 

Yet another embodiment provides an isolated antibody which specifically binds to a 
polypeptide selected from the group consisting of a) a polypeptide comprising an amino acid sequence 
selected from the group consisting of SEQ ID NO:l-57, b) a polypeptide comprising a naturally 

20 occurring amino acid sequence at least 90% identical or at least about 90% identical to an amino acid 
sequence selected from the group consisting of SEQ ID NO:l-57, c) a biologically active fragment of 
a polypeptide having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-57, 
and d) an immunogenic fragment of a polypeptide having an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-57. 

25 Still yet another embodiment provides an isolated polynucleotide selected from the group 

consisting of a) a polynucleotide comprising a polynucleotide sequence selected from the group 
consisting of SEQ ID NO:58-114, b) a polynucleotide comprising a naturally occurring polynucleotide 
sequence at least 90% identical or at least about 90% identical to a polynucleotide sequence selected 
from the group consisting of SEQ ID NO:58-l 14, c) a polynucleotide complementary to the 

30 polynucleotide of a), d) a polynucleotide complementary to the polynucleotide of b), and e) an RNA 
equivalent of a)-d). In other embodiments, the polynucleotide can comprise at least about 20, 30, 40, 
60, 80, or 100 contiguous nucleotides. 
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Yet another embodiment provides a method for detecting a target polynucleotide in a sample, 
said target polynucleotide being selected from the group consisting of a) a polynucleotide comprising a 
polynucleotide sequence selected from the group consisting of SEQ ID NO:58-114, b) a polynucleotide 
comprising a naturally occurring polynucleotide sequence at least 90% identical or at least about 90% 

5 identical to a polynucleotide sequence selected from the group consisting of SEQ ID NO:58-l 14, c) a 
polynucleotide complementary to the polynucleotide of a), d) a polynucleotide complementary to the 
polynucleotide of b), and e) an RNA equivalent of a)-d). The method comprises a) hybridizing the 
sample with a probe comprising at least 20 contiguous nucleotides comprising a sequence 
complementary to said target polynucleotide in the sample, and which probe specifically hybridizes to 

10 said target polynucleotide, under conditions whereby a hybridization complex is formed between said 
probe and said target polynucleotide or fragments thereof, and b) detecting the presence or absence of 
said hybridization complex. In a related embodiment, the method can include detecting the amount of 
the hybridization complex. In still other embodiments, the probe can comprise at least about 20, 30, 
40, 60, 80, or 100 contiguous nucleotides. 

15 Still yet another embodiment provides a method for detecting a target polynucleotide in a 

sample, said target polynucleotide being selected from the group consisting of a) a polynucleotide 
comprising a polynucleotide sequence selected from the group consisting of SEQ ID NO:58-114, b) a 
polynucleotide comprising a naturally occurring polynucleotide sequence at least 90% identical or at 
least about 90% identical to a polynucleotide sequence selected from the group consisting of SEQ ID 

20 NO:58-l 14, c) a polynucleotide complementary to the polynucleotide of a), d) a polynucleotide 
complementary to the polynucleotide of b), and e) an RNA equivalent of a)-d). The method 
comprises a) amplifying said target polynucleotide or fragment thereof using polymerase chain 
reaction amplification, and b) detecting the presence or absence of said amplified target polynucleotide 
or fragment thereof. In a related embodiment, the method can include detecting the amount of the 

25 amplified target polynucleotide or fragment thereof. 

Another embodiment provides a composition comprising an effective amount of a polypeptide 
selected from the group consisting of a) a polypeptide comprising an amino acid sequence selected 
from the group consisting of SEQ ID NO:l-57, b) a polypeptide comprising a naturally occurring 
amino acid sequence at least 90% identical or at least about 90% identical to an amino acid sequence 

30 selected from the group consisting of SEQ ID NO:l-57, c) a biologically active fragment of a 

polypeptide having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-57, 
and d) an immunogenic fragment of a polypeptide having an amino acid sequence selected from the 
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group consisting of SEQ ID NO: 1-57, and a pharmaceutical^ acceptable excipient. In one 
embodiment, the composition can comprise an amino acid sequence selected from the group consisting 
of SEQ ID NO: 1-57. Other embodiments provide a method of treating a disease or condition 
associated with decreased or abnormal expression of functional NAAP, comprising administering to a 

5 patient in need of such treatment the composition. 

Yet another embodiment provides a method for screening a compound for effectiveness as an 
agonist of a polypeptide selected from the group consisting of a) a polypeptide comprising an amino 
acid sequence selected from the group consisting of SEQ ED NO: 1-57, b) a polypeptide comprising a 
naturally occurring amino acid sequence at least 90% identical or at least about 90% identical to an 

10 amino acid sequence selected from the group consisting of SEQ ID NO:l-57, c) a biologically active 
fragment of a polypeptide having an amino acid sequence selected from the group consisting of SEQ 
ID NO: 1-57, and d) an immunogenic fragment of a polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NO:l-57. The method comprises a) exposing a sample 
comprising the polypeptide to a compound, and b) detecting agonist activity in the sample. Another 

15 embodiment provides a composition comprising an agonist compound identified by the method and a 
pharmaceutical^ acceptable excipient. Yet another embodiment provides a method of treating a 
disease or condition associated with decreased expression of functional NAAP, comprising 
administering to a patient in need of such treatment the composition. 

Still yet another embodiment provides a method for screening a compound for effectiveness 

20 as an antagonist of a polypeptide selected from the group consisting of a) a polypeptide comprising an 
amino acid sequence selected from the group consisting of SEQ ED NO: 1-57, b) a polypeptide 
comprising a naturally occurring amino acid sequence at least 90% identical or at least about 90% 
identical to an amino acid sequence selected from the group consisting of SEQ ID NO:l-57, c) a 
biologically active fragment of a polypeptide having an amino acid sequence selected from the group 

25 consisting of SEQ ID NO: 1-57, and d) an immunogenic fragment of a polypeptide having an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-57. The method comprises a) 
exposing a sample comprising the polypeptide to a compound, and b) detecting antagonist activity in 
the sample. Another embodiment provides a composition comprising an antagonist compound 
identified by the method and a pharmaceutically acceptable excipient Yet another embodiment 

30 provides a method of treating a disease or condition associated with overexpression of functional 
NAAP, comprising administering to a patient in need of such treatment the composition. 

Another embodiment provides a method of screening for a compound that specifically binds to 
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a polypeptide selected from the group consisting of a) a polypeptide comprising an amino acid 
sequence selected from the group consisting of SEQ ID NO:l-57, b) a polypeptide comprising a 
naturally occurring amino acid sequence at least 90% identical or at least about 90% identical to an 
amino acid sequence selected from the group consisting of SEQ ID NO:l-57, c) a biologically active 

5 fragment of a polypeptide having an amino acid sequence selected from the group consisting of SEQ 
ID NO:l-57, and d) an immunogenic fragment of a polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-57. The method comprises a) combining the 
polypeptide with at least one test compound under suitable conditions, and b) detecting binding of die 
polypeptide to the test compound, thereby identifying a compound that specifically binds to the 

10 polypeptide. 

Yet another embodiment provides a method of screening for a compound that modulates the 
activity of a polypeptide selected from the group consisting of a) a polypeptide comprising an amino 
acid sequence selected from the group consisting of SEQ ID NO:l-57, b) a polypeptide comprising a 
naturally occurring amino acid sequence at least 90% identical or at least about 90% identical to an 

15 amino acid sequence selected from the group consisting of SEQ ID NO:l-57, c) a biologically active 
fragment of a polypeptide having an amino acid sequence selected from the group consisting of SEQ 
ID NO:l-57, and d) an immunogenic fragment of a polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-57. The method comprises a) combining the 
polypeptide with at least one test compound under conditions permissive for the activity of the 

20 polypeptide, b) assessing the activity of the polypeptide in the presence of the test compound, and c) 
comparing the activity of the polypeptide in the presence of the test compound with the activity of the 
polypeptide in the absence of the test compound, wherein a change in the activity of the polypeptide in 
the presence of the test compound is indicative of a compound that modulates the activity of the 
polypeptide. 

25 Still yet another embodiment provides a method for screening a compound for effectiveness in 

altering expression of a target polynucleotide, wherein said target polynucleotide comprises a 
polynucleotide sequence selected from the group consisting of SEQ ID NO:58-114, the method 
comprising a) exposing a sample comprising the target polynucleotide to a compound, b) detecting 
altered expression of the target polynucleotide, and c) comparing the expression of the target 

30 polynucleotide in the presence of varying amounts of the compound and in the absence of the 
compound. 

Another embodiment provides a method for assessing toxicity of a test compound, said 
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method comprising a) treating a biological sample containing nucleic acids with the test compound; b) 
hybridizing the nucleic acids of the treated biological sample with a probe comprising at least 20 
contiguous nucleotides of a polynucleotide selected from the group consisting of i) a polynucleotide 
comprising a polynucleotide sequence selected from the group consisting of SEQ ID NO:58-l 14, ii) a 

5 polynucleotide comprising a naturally occurring polynucleotide sequence at least 90% identical or at 
least about 90% identical to a polynucleotide sequence selected from the group consisting of SEQ ID 
NO:58-l 14, iii) a polynucleotide having a sequence complementary to i), iv) a polynucleotide 
complementary to the polynucleotide of ii), and v) an RNA equivalent of i)-iv). Hybridization occurs 
under conditions whereby a specific hybridization complex is formed between said probe and a target 

10 polynucleotide in the biological sample, said target polynucleotide selected from the group consisting of 
i) a polynucleotide comprising a polynucleotide sequence selected from the group consisting of SEQ 
ID NO:58-114, ii) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 
90% identical or at least about 90% identical to a polynucleotide sequence selected from the group 
consisting of SEQ ID NO:58-114, iii) a polynucleotide complementary to the polynucleotide of i), iv) a 

15 polynucleotide complementary to the polynucleotide of ii), and v) an RNA equivalent of i)-iv). 

Alternatively, the target polynucleotide can comprise a fragment of a polynucleotide selected from the 
group consisting of i)-v) above; c) quantifying the amount of hybridization complex; and d) comparing 
the amount of hybridization complex in the treated biological sample with the amount of hybridization 
complex in an untreated biological sample, wherein a difference in the amount of hybridization 

20 complex in the treated biological sample is indicative of toxicity of the test compound. 

BRIEF DESCRIPTION OF THE TABLES 

Table 1 summarizes the nomenclature for full length polynucleotide and polypeptide 
embodiments of the invention. 
25 Table 2 shows the GenBank identification number and annotation of the nearest GenBank 

homolog, and the PROTEOME database identification numbers and annotations of PROTEOME 
database homologs, for polypeptide embodiments of the invention. The probability scores for the 
matches between each polypeptide and its homolog(s) are also shown. 

Table 3 shows structural features of polypeptide embodiments, including predicted motifs and 
30 domains, along with the methods, algorithms, and searchable databases used for analysis of the 
polypeptides. 

Table 4 lists the cDNA and/or genomic DNA fragments which were used to assemble 
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polynucleotide embodiments, along with selected fragments of the polynucleotides. 

Table 5 shows representative cDNA libraries for polynucleotide embodiments. 

Table 6 provides an appendix which describes the tissues and vectors used for construction of 
the cDNA libraries shown in Table 5. 
5 Table 7 shows the tools, programs, and algorithms used to analyze polynucleotides and 

polypeptides, along with applicable descriptions, references, and threshold parameters. 

Table 8 shows single nucleotide polymorphisms found in polynucleotide sequences of the 
invention, along with allele frequencies in different human populations. 

10 DESCRIPTION OF THE INVENTION 

Before the present proteins, nucleic acids, and methods are described, it is understood that 
embodiments of the invention are not limited to the particular machines, instruments, materials, and 
methods described, as these may vary. It is also to be understood that the terminology used herein is 
for the purpose of describing particular embodiments only, and is not intended to limit the scope of the 
15 invention. 

As used herein and in the appended claims, the singular forms "a," "an," and "the" include 
plural reference unless the context clearly dictates otherwise. Thus, for example, a reference to "a 
host cell" includes a plurality of such host cells, and a reference to "an antibody" is a reference to one 
or more antibodies and equivalents thereof known to those skilled in the art, and so forth. 

20 Unless defined otherwise, all technical and scientific terms used herein have the same 

meanings as commonly understood by one of ordinary skill in the art to which this invention belongs. 
Although any machines, materials, and methods similar or equivalent to those described herein can be 
used to practice or test the present invention, the preferred machines, materials and methods are now 
described. All publications mentioned herein are cited for the purpose of describing and disclosing the 

25 cell lines, protocols, reagents and vectors which are reported in the publications and which might be 
used in connection with various embodiments of the invention. Nothing herein is to be construed as an 
admission that the invention is not entitled to antedate such disclosure by virtue of prior invention. 
DEFINITIONS 

"NAAP" refers to the amino acid sequences of substantially purified NAAP obtained from 
30 any species, particularly a mammalian species, including bovine, ovine, porcine, murine, equine, and 
human, and from any source, whether natural, synthetic, semi-synthetic, or recombinant 

The term "agonist" refers to a molecule which intensifies or mimics the biological activity of 
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NAAP. Agonists may include proteins, nucleic acids, carbohydrates, small molecules, or any other 
compound or composition which modulates the activity of NAAP either by directly interacting with 
NAAP or by acting on components of the biological pathway in which NAAP participates. 

An "allelic variant" is an alternative form of the gene encoding NAAP. Allelic variants may 

5 result from at least one mutation in the nucleic acid sequence and may result in altered mRN As or in 
polypeptides whose structure or function may or may not be altered. A gene may have none, one, or 
many allelic variants of its naturally occurring form. Common mutational changes which give rise to 
allelic variants are generally ascribed to natural deletions, additions, or substitutions of nucleotides. 
Each of these types of changes may occur alone, or in combination with the others, one or more times 

10 in a given sequence. 

"Altered" nucleic acid sequences encoding NAAP include those sequences with deletions, 
insertions, or substitutions of different nucleotides, resulting in a polypeptide the same as NAAP or a 
polypeptide with at least one functional characteristic of NAAP. Included within this definition are 
polymorphisms which may or may not be readily detectable using a particular oligonucleotide probe of 

15 the polynucleotide encoding NAAP, and improper or unexpected hybridization to allelic variants, with a 
locus other than the normal chromosomal locus for the polynucleotide encoding NAAP. The encoded 
protein may also be "altered," and may contain deletions, insertions, or substitutions of amino acid 
residues which produce a silent change and result in a functionally equivalent NAAP. Deliberate 
amino acid substitutions may be made on the basis of one or more similarities in polarity, charge, 

20 solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature of the residues, as long as the 
biological or immunological activity of NAAP is retained. For example, negatively charged amino 
acids may include aspartic acid and glutamic acid, and positively charged amino acids may include 
lysine and arginine. Amino acids with uncharged polar side chains having similar hydrophilicity values 
may include: asparagine and glutamine; and serine and threonine. Amino acids with uncharged side 

25 chains having similar hydrophilicity values may include: leucine, isoleucine, and valine; glycine and 
alanine; and phenylalanine and tyrosine. 

The terms "amino acid" and "amino acid sequence" can refer to an oligopeptide, a peptide, a 
polypeptide, or a protein sequence, or a fragment of any of these, and to naturally occurring or 
synthetic molecules. Where "amino acid sequence" is recited to refer to a sequence of a naturally 

30 occurring protein molecule, "amino acid sequence" and like terms are not meant to limit the amino acid 
sequence to the complete native amino acid sequence associated with the recited protein molecule. 

"Amplification" relates to the production of additional copies of a nucleic acid. Amplification 
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maybe carried out using polymerase chain reaction (PCR) technologies or other nucleic acid 
amplification technologies well known in the art. 

The term "antagonist" refers to a molecule which inhibits or attenuates the biological activity 
of NAAP. Antagonists may include proteins such as antibodies, anticalins, nucleic acids, 

5 carbohydrates, small molecules, or any other compound or composition which modulates the activity of 
NAAP either by directly interacting with NAAP or by acting on components of the biological pathway 
in which NAAP participates. 

The term "antibody" refers to intact immunoglobulin molecules as well as to fragments 
thereof, such as Fab, F(ab') 2 , and Fv fragments, which are capable of binding an epitopic determinant 

10 Antibodies that bind NAAP polypeptides can be prepared using intact polypeptides or using fragments 
containing small peptides of interest as the immunizing antigen. The polypeptide or oligopeptide used 
to immunize an animal (e.g., a mouse, a rat, or a rabbit) can be derived from the translation of RNA, 
or synthesized chemically, and can be conjugated to a carrier protein if desired. Commonly used 
carriers that are chemically coupled to peptides include bovine serum albumin, thyroglobulin, and 

15 keyhole limpet hemocyanin (KLH). The coupled peptide is then used to immunize the animal. 

The term "antigenic determinant" refers to that region of a molecule (i.e., an epitope) that 
makes contact with a particular antibody. When a protein or a fragment of a protein is used to 
immunize a host animal, numerous regions of the protein may induce the production of antibodies 
which bind specifically to antigenic determinants (particular regions or three-dimensional structures on 

20 the protein). An antigenic determinant may compete with the intact antigen (i.e., the immunogen used 
to elicit the immune response) for binding to an antibody. 

The term "aptamer" refers to a nucleic acid or oligonucleotide molecule that binds to a 
specific molecular target. Aptamers are derived from an in vitro evolutionary process (e.g., SELEX 
(Systematic Evolution of Ligands by Exponential Enrichment), described in U.S. Patent No. 

25 5,270,163), which selects for target-specific aptamer sequences from large combinatorial libraries. 
Aptamer compositions may be double-stranded or single-stranded, and may include 
deoxyribonucleotides, ribonucleotides, nucleotide derivatives, or other nucleotide-like molecules. The 
nucleotide components of an aptamer may have modified sugar groups (e.g,, the 2 -OH group of a 
ribonucleotide may be replaced by 2'-F or 2 , -NH 2 ), which may improve a desired property, e.g., 

30 resistance to nucleases or longer lifetime in blood. Aptamers may be conjugated to other molecules, 
e.g., a high molecular weight carrier to slow clearance of the aptamer from the circulatory system. 
Aptamers may be specifically cross-linked to their cognate ligands, e.g., by photo-activation of a 
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cross-linker (Brody, E.N. and L. Gold (2000) J. Biotechnol. 74:5-13). 

The terra "intramer" refers to an aptamer which is expressed in vivo. For example, a 
vaccinia virus-based RNA expression system has been used to express specific RNA aptamers at 
high levels in the cytoplasm of leukocytes (Blind, M. et al. (1999) Proc. Natl. Acad. Sci. USA 
5 96:3606-3610). 

The term "spiegelmer" refers to an aptamer which includes L-DNA, L-RNA, or other left- 
handed nucleotide derivatives or nucleotide-like molecules. Aptamers containing left-handed 
nucleotides are resistant to degradation by naturally occurring enzymes, which normally act on 
substrates containing right-handed nucleotides. 

10 The term "antisense" refers to any composition capable of base-pairing with the "sense" 

(coding) strand of a polynucleotide having a specific nucleic acid sequence. Antisense compositions 
may include DNA; RNA; peptide nucleic acid (PNA); oligonucleotides having modified backbone 
linkages such as phosphorothioates, methylphosphonates, or benzylphosphonates; oligonucleotides 
having modified sugar groups such as 2-methoxyethyl sugars or 2-methoxyethoxy sugars; or 

15 oligonucleotides having modified bases such as 5-methyl cytosine, 2-deoxyuracil, or 7-deaza-2'- 
deoxyguanosine. Antisense molecules may be produced by any method including chemical synthesis 
or transcription. Once introduced into a cell, the complementary antisense molecule base-pairs with a 
naturally occurring nucleic acid sequence produced by the cell to form duplexes which block either 
transcription or translation. The designation "negative" or "minus" can refer to the antisense strand, 

20 and the designation "positive" or "phis" can refer to the sense strand of a reference DNA molecule. 

The term "biologically active" refers to a protein having structural, regulatory, or biochemical 
functions of a naturally occurring molecule. Likewise, "immunologically active" or "immunogenic" 
refers to the capability of the natural, recombinant, or synthetic NAAP, or of any oligopeptide thereof, 
to induce a specific immune response in appropriate animals or cells and to bind with specific 

25 antibodies. 

"Complementary" describes the relationship between two single-stranded nucleic acid 
sequences that anneal by base-pairing. For example, 5-AGT-3' pairs with its complement, 
3'-TCA-5\ 

A "composition comprising a given polynucleotide" and a "composition comprising a given 
30 polypeptide" can refer to any composition containing the given polynucleotide or polypeptide. The 
composition may comprise a dry formulation or an aqueous solution. Compositions comprising 
polynucleotides encoding NAAP or fragments of NAAP maybe employed as hybridization probes. 
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The probes may be stored in freeze-dried form and may be associated with a stabilizing agent such as 
a carbohydrate. In hybridizations, the probe maybe deployed in an aqueous solution containing salts 
(e.g., NaCl), detergents (e.g., sodium dodecyl sulfate; SDS), and other components (e.g., Denhardt's 
solution, dry milk, salmon sperm DNA, etc.). 

5 "Consensus sequence" refers to a nucleic acid sequence which has been subjected to 

repeated DNA sequence analysis to resolve uncalled bases, extended using the XL-PCR kit (Applied 
Biosystems, Foster City CA) in the 5' and/or the 3' direction, and resequenced, or which has been 
assembled from one or more overlapping cDNA, EST, or genomic DNA fragments using a computer 
program for fragment assembly, such as the GELVIEW fragment assembly system (Accelrys, 

10 Burlington MA) or Fhrap (University of Washington, Seattle WA). Some sequences have been both 
extended and assembled to produce the consensus sequence. 

"Conservative amino acid substitutions" are those substitutions that are predicted to least 
interfere with the properties of the original protein, i.e., the structure and especially the function of the 
protein is conserved and not significantly changed by such substitutions. The table below shows amino 

15 acids which may be substituted for an original amino acid in a protein and which are regarded as 





conservative amino acid substitutions. 

Original Residue 


Conservative Substitution 




Ala 


Gly, Ser 




Arg 


His, Lys 


20 


Asn 


Asp, Gin, His 




Asp 


Asn, Glu 




Cys 


Ala, Ser 




Gin 


Asn, Glu, His 




Glu 


Asp, Gin, His 


25 


Gly 


Ala 




His 


Asn, Arg, Gin, Glu 




De 


Leu, Val 




Leu 


He, Val 




Lys 


Arg, Gin, Glu 


30 


Met 


Leu, De 




Phe 


His, Met, Leu, Trp, Tyr 




Ser 


Cys, Thr 




Thr 


Ser, Val 




Trp 


Phe, Tyr 


35 


Tyr 


His, Phe, Trp 




Val 


De, Leu, Thr 



Conservative amino acid substitutions generally maintain (a) the structure of the polypeptide 
backbone in the area of the substitution, for example, as a beta sheet or alpha helical conformation, 
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(b) the charge or hydrophobicity of the molecule at the site of the substitution, and/or (c) the bulk of 
die side chain. 

A "deletion" refers to a change in the amino acid or nucleotide sequence that results in the 
absence of one or more amino acid residues or nucleotides. 

5 The term "derivative" refers to a chemically modified polynucleotide or polypeptide. 

Chemical modifications of a polynucleotide can include, for example, replacement of hydrogen by an 
alkyl, acyl, hydroxyl, or amino group. A derivative polynucleotide encodes a polypeptide which retains 
at least one biological or immunological function of the natural molecule. A derivative polypeptide is 
one modified by glycosylation, pegylation, or any similar process that retains at least one biological or 

10 immunological function of the polypeptide from which it was derived. 

A "detectable label" refers to a reporter molecule or enzyme that is capable of generating a 
measurable signal and is covalently or noncovalently joined to a polynucleotide or polypeptide. 

"Differential expression" refers to increased orupregulated; or decreased, downregulated, or 
absent gene or protein expression, determined by comparing at least two different samples. Such 

15 comparisons may be carried out between, for example, a treated and an untreated sample, or a 
diseased and a normal sample. 

"Exon shuffling" refers to the recombination of different coding regions (exons). Since an 
exon may represent a structural or functional domain of the encoded protein, new proteins may be 
assembled through the novel reassortment of stable substructures, thus allowing acceleration of the 

20 evolution of new protein functions. 

A "fragment" is a unique portion of NAAP or a polynucleotide encoding NAAP which can be 
identical in sequence to, but shorter in length than, the parent sequence. A fragment may comprise up 
to the entire length of the defined sequence, minus one nucleotide/amino acid residue. For example, a 
fragment may comprise from about 5 to about 1000 contiguous nucleotides or amino acid residues. A 

25 fragment used as a probe, primer, antigen, therapeutic molecule, or for other purposes, may be at least 
5, 10, 15, 16, 20, 25, 30, 40, 50, 60, 75, 100, 150, 250 or at least 500 contiguous nucleotides or amino 
acid residues in length. Fragments may be preferentially selected from certain regions of a molecule. 
For example, a polypeptide fragment may comprise a certain length of contiguous amino acids 
selected from the first 250 or 500 amino acids (or first 25% or 50%) of a polypeptide as shown in a 

30 certain defined sequence. Clearly these lengths are exemplary, and any length that is supported by the 
specification, including the Sequence Listing, tables, and figures, may be encompassed by the present 
embodiments. 
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A fragment of SEQ ED NO:58-114 can comprise a region of unique polynucleotide sequence 
that specifically identifies SEQ ED NO:58-l 14, for example, as distinct from any other sequence in the 
genome from which the fragment was obtained. A fragment of SEQ ID NO:58-l 14 can be employed 
in one or more embodiments of methods of the invention, for example, in hybridization and 

5 amplification technologies and in analogous methods that distinguish SEQ ID NO:58-l 14 from related 
polynucleotides. The precise length of a fragment of SEQ ID NO:58-l 14 and the region of SEQ ID 
NO:58-l 14 to which the fragment corresponds are routinely determinable by one of ordinary skill in 
the art based on the intended purpose for the fragment. 

A fragment of SEQ ED NO:l-57 is encoded by a fragment of SEQ ID NO:58-l 14. A 

10 fragment of SEQ ID NO: 1-57 can comprise a region of unique amino acid sequence that specifically 
identifies SEQ ID NO:l-57. For example, a fragment of SEQ ID NO:l-57 can be used as an 
immunogenic peptide for the development of antibodies that specifically recognize SEQ ID NO:l-57. 
The precise length of a fragment of SEQ ID NO:l-57 and the region of SEQ ID NO:l-57 to which 
the fragment corresponds can be determined based on the intended purpose for the fragment using 

15 one or more analytical methods described herein or otherwise known in the art. 

A "full length" polynucleotide is one containing at least a translation initiation codon (e.g., 
methionine) followed by an open reading frame and a translation termination codon, A "full length" 
polynucleotide sequence encodes a "full length" polypeptide sequence. 

"Homology" refers to sequence similarity or, alternatively, sequence identity, between two or 

20 more polynucleotide sequences or two or more polypeptide sequences. 

The terms "percent identity" and "% identity," as applied to polynucleotide sequences, refer to 
the percentage of identical residue matches between at least two polynucleotide sequences aligned 
using a standardized algorithm. Such an algorithm may insert, in a standardized and reproducible way, 
gaps in the sequences being compared in order to optimize alignment between two sequences, and 

25 therefore achieve a more meaningful comparison of the two sequences. 

Percent identity between polynucleotide sequences may be determined using one or more 
computer algorithms or programs known in the art or described herein. For example, percent identity 
can be determined using the default parameters of the CLUSTAL V algorithm as incorporated into 
the MEG ALIGN version 3.12e sequence alignment program. This program is part of the 

30 LASERGENE software package, a suite of molecular biological analysis programs (DNASTAR, 
Madison WI). CLUSTAL V is described in Higgins, D.G. and P.M. Sharp (1989; CABIOS 5:151- 
153) and in Higgins, D.G. et al. (1992; CABIOS 8:189-191). For pairwise alignments of 
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polynucleotide sequences, the default parameters are set as follows: Ktuple=2, gap penalty=5, 
window=4, and "diagonals saved"=4. The Weighted" residue weight table is selected as the default. 

Alternatively, a suite of commonly used and freely available sequence comparison algorithms 
which can be used is provided by the National Center for Biotechnology Information (NCBI) Basic 
5 Local Alignment Search Tool (BLAST) (Altschul, S.F. et al. (1990) J. Mol. Biol. 215:403-410), which 
is available from several sources, including the NCBI, Bethesda, MD, and on the Internet at 
http://www.ncbi.nlm.nih.gov/BLAST/. The BLAST software suite includes various sequence analysis 
programs including "blastn that is used to align a known polynucleotide sequence with other 
polynucleotide sequences from a variety of databases. Also available is a tool called "BLAST 2 
10 Sequences" that is used for direct pairwise comparison of two nucleotide sequences. "BLAST 2 

Sequences" can be accessed and used interactively at http://www.ncbi.nlm.nih.gov/gorf/bl2.html. The 
"BLAST 2 Sequences" tool can be used for both blastn and blastp (discussed below). BLAST 
programs are commonly used with gap and other parameters set to default settings. For example, to 
compare two nucleotide sequences, one may use blastn with the "BLAST 2 Sequences" tool Version 
1 5 2.0. 12 (April-2 1-2000) set at default parameters. Such default parameters may be, for example: 

Matrix: BLOSUM62 

Reward for match: 1 

Penalty for mismatch: -2 

Open Gap: 5 and Extension Gap: 2 penalties 
20 Gap x drop-off: 50 

Expect: 10 

Word Size: 11 

Filter: on 

Percent identity may be measured over the length of an entire defined sequence, for example, 
25 as defined by a particular SEQ ID number, or may be measured over a shorter length, for example, 
over the length of a fragment taken from a larger, defined sequence, for instance, a fragment of at 
least 20, at least 30, at least 40, at least 50, at least 70, at least 100, or at least 200 contiguous 
nucleotides. Such lengths are exemplary only, and it is understood that any fragment length supported 
by the sequences shown herein, in the tables, figures, or Sequence Listing, may be used to describe a 
30 length over which percentage identity may be measured. 

Nucleic acid sequences that do not show a high degree of identity may nevertheless encode 
similar amino acid sequences due to the degeneracy of the genetic code. It is understood that changes 
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in a nucleic acid sequence can be made using this degeneracy to produce multiple nucleic acid 
sequences that all encode substantially the same protein. 

The phrases "percent identity" and "% identity," as applied to polypeptide sequences, refer to 
the percentage of identical residue matches between at least two polypeptide sequences aligned using 

5 a standardized algorithm. Methods of polypeptide sequence alignment are well-known. Some 
alignment methods take into account conservative amino acid substitutions. Such conservative 
substitutions, explained in more detail above, generally preserve the charge and hydrophobicity at the 
site of substitution, thus preserving the structure (and therefore function) of the polypeptide. The 
phrases "percent similarity" and "% similarity," as applied to polypeptide sequences, refer to the 

10 percentage of residue matches, including identical residue matches and conservative substitutions, 
between at least two polypeptide sequences aligned using a standardized algorithm. In contrast, 
conservative substitutions are not included in the calculation of percent identity between polypeptide 
sequences. 

Percent identity between polypeptide sequences maybe determined using the default 
15 parameters of the CLUSTAL V algorithm as incorporated into the MEG ALIGN version 3.12e 
sequence alignment program (described and referenced above). For pairwise alignments of 
polypeptide sequences using CLUSTAL V, the default parameters are set as follows: Ktuple=l, gap 
penalty=3, window =5, and "diagonals saved"=5. The PAM250 matrix is selected as the default 
residue weight table. 

20 Alternatively the NCBI BLAST software suite may be used. For example, for a pairwise 

comparison of two polypeptide sequences, one may use the "BLAST 2 Sequences" tool Version 
2.0. 12 (April-2 1-2000) with blastp set at default parameters. Such default parameters may be, for 
example: 

. Matrix: BLOSUM62 
25 Open Gap: 11 and Extension Gap: 1 penalties 

Gap x drop-off: 50 
Expect: 10 
Word Size: 3 
Filter: on 

30 Percent identity may be measured over the length of an entire defined polypeptide sequence, 

for example, as defined by a particular SEQ ID number, or may be measured over a shorter length, 
for example, over the length of a fragment taken from a larger, defined polypeptide sequence, for 
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instance, a fragment of at least 15 t at least 20, at least 30, at least 40, at least 50, at least 70 or at least 
150 contiguous residues. Such lengths are exemplary only, and it is understood that any fragment 
length supported by the sequences shown herein, in the tables, figures or Sequence Listing, may be 
used to describe a length over which percentage identity may be measured. 

5 "Human artificial chromosomes" (HACs) are linear microchromosomes which may contain 

DN A sequences of about 6 kb to 10 Mb in size and which contain all of the elements required for 
chromosome replication, segregation and maintenance. 

The term * "humanized antibody" refers to an antibody molecule in which the amino acid 
sequence in the non-antigen binding regions has been altered so that the antibody more closely 

10 resembles a human antibody, and still retains its original binding ability. 

"Hybridization" refers to the process by which a polynucleotide strand anneals with a 
complementary strand through base pairing under defined hybridization conditions. Specific 
hybridization is an indication that two nucleic acid sequences share a high degree of complementarity. 
Specific hybridization complexes form under permissive annealing conditions and remain hybridized 

15 after the "washing" step(s). The washing step(s) is particularly important in determining the 
stringency of the hybridization process, with more stringent conditions allowing less non-specific 
binding, i.e., binding between pairs of nucleic acid strands that are not perfectly matched. Permissive 
conditions for annealing of nucleic acid sequences are routinely determinable by one of ordinary skill in 
the art and may be consistent among hybridization experiments, whereas wash conditions may be 

20 varied among experiments to achieve the desired stringency, and therefore hybridization specificity. 
Permissive annealing conditions occur, for example, at 68°C in the presence of about 6 x SSC, about 
1% (w/v) SDS, and about 100 fig/ml sheared, denatured salmon sperm DNA. 

Generally, stringency of hybridization is expressed, in part, with reference to the temperature 
under which the wash step is carried out. Such wash temperatures are typically selected to be about 

25 5°C to 20°C lower than the thermal melting point (T^ for the specific sequence at a defined ionic 
strength and pH. The T m is the temperature (under defined ionic strength and pH) at which 50% of 
the target sequence hybridizes to a perfectly matched probe. An equation for calculating T m and 
conditions for nucleic acid hybridization are well known and can be found in Sambrook, J. and D.W. 
Russell (2001; Molecular Cloning: A Laboratory Manual , 3rd ed., vol. 1-3, Cold Spring Harbor Press, 

30 Cold Spring Harbor NY, ch. 9). 

High stringency conditions for hybridization between polynucleotides of the present invention 
include wash conditions of 68°C in the presence of about 0.2 x SSC and about 0.1% SDS, for 1 hour. 
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Alternatively, temperatures of about 65 °C, 60°C, 55°C, or 42 °C may be used. SSC concentration may 
be varied from about 0.1 to 2 x SSC, with SDS being present at about 0.1%. Typically, blocking 
reagents are used to block non-specific hybridization. Such blocking reagents include, for instance, 
sheared and denatured salmon sperm DNA at about 100-200 jug/ml Organic solvent, such as 

5 formamide at a concentration of about 35-50% v/v, may also be used under particular circumstances, 
such as for RNA:DNA hybridizations. Useful variations on these wash conditions will be readily 
apparent to those of ordinary skill in the art. Hybridization, particularly under high stringency 
conditions, may be suggestive of evolutionary similarity between the nucleotides. Such similarity is 
strongly indicative of a similar role for the nucleotides and their encoded polypeptides. 

10 The term "hybridization complex" refers to a complex formed between two nucleic acids by 

virtue of the formation of hydrogen bonds between complementary bases. A hybridization complex 
maybe formed in solution (e.g., C 0 t or Rot analysis) or formed between one nucleic acid present in 
solution and another nucleic acid immobilized on a solid support (e.g., paper, membranes, filters, chips, 
pins or glass slides, or any other appropriate substrate to which cells or their nucleic acids have been 

15 fixed). 

The words "insertion" and "addition" refer to changes in an amino acid or polynucleotide 
sequence resulting in the addition of one or more amino acid residues or nucleotides, respectively. 

"Immune response" can refer to conditions associated with inflammation, trauma, immune 
disorders, or infectious or genetic disease, etc. These conditions can be characterized by expression 
20 of various factors, e.g., cytokines, chemokines, and other signaling molecules, which may affect 
cellular and systemic defense systems. 

An "immunogenic fragment" is a polypeptide or oligopeptide fragment of NAAP which is 
capable of eliciting an immune response when introduced into a living organism, for example, a 
mammal. The term "immunogenic fragment" also includes any polypeptide or oligopeptide fragment 
25 of NAAP which is useful in any of the antibody production methods disclosed herein or known in the 
art. 

The term "microarray" refers to an arrangement of a plurality of polynucleotides, 
polypeptides, antibodies, or other chemical compounds on a substrate. 

The terms "element" and "array element" refer to a polynucleotide, polypeptide, antibody, or 
30 other chemical compound having a unique and defined position on a microarray. 

The term "modulate" refers to a change in the activity of NAAP. For example, modulation 
may cause an increase or a decrease in protein activity, binding characteristics, or any other biological, 
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functional, or immunological properties of NAAP. 

The phrases "nucleic acid" and "nucleic acid sequence" refer to a nucleotide, oligonucleotide, 
polynucleotide, or any fragment thereof. These phrases also refer to DNA or RNA of genomic or 
synthetic origin which may be single-stranded or double-stranded and may represent the sense or the 

5 antisense strand, to peptide nucleic acid (PNA), or to any DNA-like or RNA-like material. 

"Operably linked" refers to the situation in which a first nucleic acid sequence is placed in a 
functional relationship with a second nucleic acid sequence. For instance, a promoter is operably 
linked to a coding sequence if the promoter affects the transcription or expression of the coding 
sequence. Operably linked DNA sequences may be in close proximity or contiguous and, where 

10 necessary to join two protein coding regions, in the same reading frame. 

"Peptide nucleic acid" (PNA) refers to an antisense molecule or anti-gene agent which 
comprises an oligonucleotide of at least about 5 nucleotides in length linked to a peptide backbone of 
amino acid residues ending in lysine. The terminal lysine confers solubility to the composition. PNAs 
preferentially bind complementary single stranded DNA or RNA and stop transcript elongation, and 

15 may be pegylated to extend their lifespan in the cell. 

"Post-translational modification" of an NAAP may involve lipidation, glycosylation, 
phosphorylation, acetylation, racemization, proteolytic cleavage, and other modifications known in the 
art These processes may occur synthetically or biochemically. Biochemical modifications will vary 
by cell type depending on the enzymatic milieu of NAAP. 

20 "Probe" refers to nucleic acids encoding NAAP, their complements, or fragments thereof, 

which are used to detect identical, allelic or related nucleic acids. Probes are isolated oligonucleotides 
or polynucleotides attached to a detectable label or reporter molecule. Typical labels include 
radioactive isotopes, ligands, chemiluminescent agents, and enzymes. "Primers" are short nucleic 
acids, usually DNA oligonucleotides, which may be annealed to a target polynucleotide by 

25 complementary base-pairing. The primer may then be extended along the target DNA strand by a 
DNA polymerase enzyme. Primer pairs can be used for amplification (and identification) of a nucleic 
acid, e.g., by the polymerase chain reaction (PCR). 

Probes and primers as used in the present invention typically comprise at least 15 contiguous 
nucleotides of a known sequence. In order to enhance specificity, longer probes and primers may also 

30 be employed, such as probes and primers that comprise at least 20, 25, 30, 40, 50, 60, 70, 80, 90, 100, 
or at least 150 consecutive nucleotides of the disclosed nucleic acid sequences. Probes and primers 
may be considerably longer than these examples, and it is understood that any length supported by the 
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specification, including the tables, figures, and Sequence Listing, may be used. 

Methods for preparing and using probes and primers are described in, for example, Sambrook, 
J. and D.W. Russell (2001; Molecular C lonin g A Laboratory Manual , 3rd ed., vol. 1-3, Cold Spring 
Harbor Press, Cold Spring Harbor NY), Ausubel, F.M. et al. (1999; Short Protocols in Molecular 

5 Biology . 4 th ed., John Wiley & Sons, New York NY), and Innis, M. et al. (1990; PCR Protocols, A 
Guide to Methods and Applications , Academic Press, San Diego CA). PCR primer pairs can be 
derived from a known sequence, for example, by using computer programs intended for that purpose 
such as Primer (Version 0.5, 1991, Whitehead Institute for Biomedical Research, Cambridge MA). 
Oligonucleotides for use as primers are selected using software known in the art for such 

10 purpose. For example, OLIGO 4.06 software is useful for the selection of PCR primer pairs of up to 
100 nucleotides each, and for the analysis of oligonucleotides and larger polynucleotides of up to 5,000 
nucleotides from an input polynucleotide sequence of up to 32 kilobases. Similar primer selection 
programs have incorporated additional features for expanded capabilities. For example, the PrimOU 
primer selection program (available to the public from the Genome Center at University of Texas 

15 South West Medical Center, Dallas TX) is capable of choosing specific primers from megabase 
sequences and is thus useful for designing primers on a genome- wide scope. The Primer3 primer 
selection program (available to the public from the Whitehead Institute/MIT Center for Genome 
Research, Cambridge MA) allows the user to input a '^mispriming library," in which sequences to 
avoid as primer binding sites are user-specified. Primer3 is useful, in particular, for the selection of 

20 oligonucleotides for microarrays. (The source code for the latter two primer selection programs may 
also be obtained from their respective sources and modified to meet the user's specific needs.) The 
PrimeGen program (available to the public from the UK Human Genome Mapping Project Resource 
Centre, Cambridge UK) designs primers based on multiple sequence alignments, thereby allowing 
selection of primers that hybridize to either the most conserved or least conserved regions of aligned 

25 nucleic acid sequences. Hence, this program is useful for identification of both unique and conserved 
oligonucleotides and polynucleotide fragments. The oligonucleotides and polynucleotide fragments 
identified by any of the above selection methods are useful in hybridization technologies, for example, 
as PCR or sequencing primers, microarray elements, or specific probes to identify fully or partially 
complementary polynucleotides in a sample of nucleic acids. Methods of oligonucleotide selection are 

30 not limited to those described above. 

A "recombinant nucleic acid" is a nucleic acid that is not naturally occurring or has a 
sequence that is made by an artificial combination of two or more otherwise separated segments of 
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sequence. This artificial combination is often accomplished by chemical synthesis or, more commonly, 
by the artificial manipulation of isolated segments of nucleic acids, e.g., by genetic engineering 
techniques such as those described in Sambrook and Russell (supra). The term recombinant includes 
nucleic acids that have been altered solely by addition, substitution, or deletion of a portion of the 

5 nucleic acid. Frequently, a recombinant nucleic acid may include a nucleic acid sequence operably 
linked to a promoter sequence. Such a recombinant nucleic acid may be part of a vector that is used, 
for example, to transform a cell. 

Alternatively, such recombinant nucleic acids may be part of a viral vector, e.g., based on a 
vaccinia vims, that could be use to vaccinate a mammal wherein the recombinant nucleic acid is 

10 expressed, inducing a protective immunological response in the mammal. 

A "regulatory element** refers to a nucleic acid sequence usually derived from untranslated 
regions of a gene and includes enhancers, promoters, introns, and 5' and 3' untranslated regions 
(UTRs). Regulatory elements interact with host or viral proteins which control transcription, 
translation, or RNA stability. 

15 "Reporter molecules" are chemical or biochemical moieties used for labeling a nucleic acid, 

amino acid, or antibody. Reporter molecules include radionuclides; enzymes; fluorescent, 
chemihiminescent, or chromogenic agents; substrates; cofactors; inhibitors; magnetic particles; and 
other moieties known in the art. 

An "RNA equivalent," in reference to a DNA molecule, is composed of the same linear 

20 sequence of nucleotides as the reference DNA molecule with the exception that all occurrences of 
the nitrogenous base thymine are replaced with uracil, and the sugar backbone is composed of ribose 
instead of deoxyribose. 

The term "sample" is used in its broadest sense. A sample suspected of containing NAAP, 
nucleic acids encoding NAAP, or fragments thereof may comprise a bodily fluid; an extract from a 

25 cell, chromosome, organelle, or membrane isolated from a cell; a cell; genomic DNA, RNA, or cDNA, 
in solution or bound to a substrate; a tissue; a tissue print; etc. 

The terms "specific binding" and "specifically binding" refer to that interaction between a 
protein or peptide and an agonist, an antibody, an antagonist, a small molecule, or any natural or 
synthetic binding composition. The interaction is dependent upon the presence of a particular structure 

30 of the protein, e.g., the antigenic determinant or epitope, recognized by the binding molecule. For 
example, if an antibody is specific for epitope "A," the presence of a polypeptide comprising the 
epitope A, or the presence of free unlabeled A, in a reaction containing free labeled A and the 
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antibody will reduce the amount of labeled A that binds to the antibody. 

The term "substantially purified" refers to nucleic acid or amino acid sequences that are 
removed from their natural environment and are isolated or separated, and are at least about 60% 
free, preferably at least about 75% free, and most preferably at least about 90% free from other 

5 components with which they are naturally associated. 

A "substitution" refers to the replacement of one or more amino acid residues or nucleotides 
by different amino acid residues or nucleotides, respectively. 

"Substrate" refers to any suitable rigid or semi-rigid support including membranes, filters, 
chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gels, tubing, plates, polymers, 

10 microparticles and capillaries. The substrate can have a variety of surface forms, such as wells, 
trenches, pins, channels and pores, to which polynucleotides or polypeptides are bound. 

A 'transcript image" or "expression profile" refers to the collective pattern of gene expression 
by a particular cell type or tissue under given conditions at a given rime. 

'Transformation" describes a process by which exogenous DNA is introduced into a recipient 

15 cell. Transformation may occur under natural or artificial conditions according to various methods 
well known in the art, and may rely on any known method for the insertion of foreign nucleic acid 
sequences into a prokaryotic or eukaryotic host cell. The method for transformation is selected based 
on the type of host cell being transformed and may include, but is not limited to, bacteriophage or viral 
infection, electroporation, heat shock, lipofection, and particle bombardment. The term "transformed 

20 cells" includes stably transformed cells in which the inserted DNA is capable of replication either as 
an autonomously replicating plasmid or as part of the host chromosome, as well as transiently 
transformed cells which express the inserted DNA or RNA for limited periods of time. 

A "transgenic organism," as used herein, is any organism, including but not limited to animals 
and plants, in which one or more of the cells of the organism contains heterologous nucleic acid 

25 introduced by way of human intervention, such as by transgenic techniques well known in the art. The 
nucleic acid is introduced into the cell, directly or indirectly by introduction into a precursor of the cell, 
by way of deliberate genetic manipulation, such as by microinjection or by infection with a 
recombinant virus. In another embodiment, the nucleic acid can be introduced by infection with a 
recombinant viral vector, such as a lenti viral vector (Lois, C. et al. (2002) Science 295:868-872). The 

30 term genetic manipulation does not include classical cross-breeding, or in vitro fertilization, but rather 
is directed to die introduction of a recombinant DNA molecule. Hie transgenic organisms 
contemplated in accordance with the present invention include bacteria, cyanobacteria, fungi, plants 
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and animals. The isolated DNA of the present invention can be introduced into the host by methods 
known in the art, for example infection, transfection, transformation or transconjugation. Techniques 
for transferring the DNA of the present invention into such organisms are widely known and provided 
in references such as Sambrook and Russell (supra). 
5 A "variant" of a particular nucleic acid sequence is defined as a nucleic acid sequence having 

at least 40% sequence identity to the particular nucleic acid sequence over a certain length of one of 
the nucleic acid sequences using blastn with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07- 
1999) set at default parameters. Such a pair of nucleic acids may show, for example, at least 50%, at 
least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 

10 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% or greater 
sequence identity over a certain defined length. A variant may be described as, for example, an 
"allelic" (as defined above), "splice," "species," or "polymorphic" variant. A splice variant may have 
significant identity to a reference molecule, but will generally have a greater or lesser number of 
polynucleotides due to alternate splicing of exons during mRNA processing. The corresponding 

15 polypeptide may possess additional functional domains or lack domains that are present in the 

reference molecule. Species variants are polynucleotides that vary from one species to another. The 
resulting polypeptides wiD generally have significant amino acid identity relative to each other. A 
polymorphic variant is a variation in the polynucleotide sequence of a particular gene between 
individuals of a given species. Polymorphic variants also may encompass "single nucleotide 

20 polymorphisms" (SNPs) in which the polynucleotide sequence varies by one nucleotide base. The 
presence of SNPs may be indicative of, for example, a certain population, a disease state, or a 
propensity for a disease state. 

A "variant" of a particular polypeptide sequence is defined as a polypeptide sequence having 
at least 40% sequence identity or sequence similarity to the particular polypeptide sequence over a 

25 certain length of one of the polypeptide sequences using blastp with the "BLAST 2 Sequences" tool 
Version 2.0.9 (May-07-1999) set at default parameters. Such a pair of polypeptides may show, for 
example, at least 50%, at least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 
91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, 
or at least 99% or greater sequence identity or sequence similarity over a certain defined length of one 

30 of the polypeptides. 

THE INVENTION 
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Various embodiments of the invention include new human nucleic acid-associated proteins 
(NAAP), the polynucleotides encoding NAAP, and the use of these compositions for the diagnosis, 
treatment, or prevention of cell proliferative, neurological, developmental, and 
autoimmune/inflammatory disorders, and infections. 

5 Table 1 summarizes the nomenclature for the full length polynucleotide and polypeptide 

embodiments of the invention. Each polynucleotide and its corresponding polypeptide are correlated to 
a single Incyte project identification number (Incyte Project ID). Each polypeptide sequence is 
denoted by both a polypeptide sequence identification number (Polypeptide SEQ ID NO:) and an 
Incyte polypeptide sequence number (Incyte Polypeptide ID) as shown. Each polynucleotide 

10 sequence is denoted by both a polynucleotide sequence identification number (Polynucleotide SEQ ID 
NO:) and an Incyte polynucleotide consensus sequence number (Incyte Polynucleotide ID) as shown. 
Column 6 shows the Incyte ID numbers of physical, full length clones corresponding to the polypeptide 
and polynucleotide sequences of the invention. The full length clones encode polypeptides which have 
at least 95% sequence identity to the polypeptide sequences shown in column 3. 

15 Table 2 shows sequences with homology to polypeptide embodiments of the invention as 

identified by BLAST analysis against the GenBank protein (genpept) database and the PROTEOME 
database. Columns 1 and 2 show the polypeptide sequence identification number (Polypeptide SEQ 
ID NO:) and the corresponding Incyte polypeptide sequence number (Incyte Polypeptide ID) for 
polypeptides of the invention. Column 3 shows the GenBank identification number (GenBank ID NO:) 

20 of the nearest GenBank homolog and the PROTEOME database identification numbers 

(PROTEOME ID NO:) of the nearest PROTEOME database homologs. Column 4 shows the 
probability scores for the matches between each polypeptide and its homolog(s). Column 5 shows the 
annotation of the GenBank and PROTEOME database homolog(s) along with relevant citations where 
applicable, all of which are expressly incorporated by reference herein. 

25 Table 3 shows various structural features of the polypeptides of the invention. Columns 1 and 

2 show the polypeptide sequence identification number (SEQ ID NO:) and the corresponding Incyte 
polypeptide sequence number (Incyte Polypeptide ID) for each polypeptide of the invention. Column 

3 shows the number of amino acid residues in each polypeptide. Column 4 shows potential 
phosphorylation sites, and column 5 shows potential glycosylation sites, as determined by the MOTIFS 

30 program of the GCG sequence analysis software package (Accelrys, Burlington MA). Column 6 
shows amino acid residues comprising signature sequences, domains, and motifs. Column 7 shows 
analytical methods for protein structure/function analysis and in some cases, searchable databases to 
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which the analytical methods were applied. 

Together, Tables 2 and 3 summarize the properties of polypeptides of the invention, and these 
properties establish that the claimed polypeptides are nucleic acid-associated proteins. For example, 
SEQ ID NO: 11 is 100% identical, from residue Ml to residue L174 and from residue Q175 to residue 

5 Q488, to a human protein similar to interferon regulatory factor 5 (GenBank ID gl3278720) as 
determined by the Basic Local Alignment Search Tool (BLAST). (See Table 2.) The BLAST 
probability score is 5.8e-273, which indicates the probability of obtaining the observed polypeptide 
sequence alignment by chance. SEQ ID NO:l 1 also has homology to human and mouse interferon 
regulatory factor 5, a transcription factor that induces expression of members of the interferon A 

10 family in response to NDV viral infection, as determined by BLAST analysis using the PROTEOME 
database. SEQ ID NO:ll also contains an interferon regulatory factor transcription factor domain as 
determined by searching for statistically significant matches in the hidden Markov model (HMM)- 
based PFAM database of conserved protein family domains. (See Table 3.) Data from BLIMPS, 
MOTIFS, and PROFILESCAN analyses provide further corroborative evidence that SEQ ID NO: 11 

15 is a member of the interferon regulatory factor (DRF) family of transcription factors. 

hi an alternative example, SEQ ID NO:24 is 96% identical, from residue Ml to residue R133, 
to human hMBFl alpha (GenBank ID g6526355) as determined by the Basic Local Alignment Search 
Tool (BLAST). (See Table 2.) The BLAST probability score is 5.0e-63, which indicates the 
probability of obtaining the observed polypeptide sequence alignment by chance. SEQ ID NO:24 also 

20 has homology to endothelial differentiation-related factor- 1, a putative transcriptional coactivator that 
binds to calmodulin (CALM1), in a calcium-dependent manner, and to the TATA-binding protein 
(TBP), has transcription factor function, and are EDF1 transcription coactivators, as determined by 
BLAST analysis using the PROTEOME database. SEQ ID NO:24 also contains a helix-turn-helix 
domain as determined by searching for statistically significant matches in the hidden Markov model 

25 (HMM)-based PFAM database of conserved protein family domains. (See Table 3.) Data from 
additional BLAST analyses provide fiirther corroborative evidence that SEQ ID NO:24 is a 
transcription factor regulator. 

In an alternative example, SEQ ID NO:35 is 84% identical, from residue M932 to residue 
G1638, to human ras-responsive element binding protein (GenBank ID gl6541 12) as determined by 

30 the Basic Local Alignment Search Tool (BLAST). (See Table 2.) The BLAST probability score is 
0.0, which indicates the probability of obtaining the observed polypeptide sequence alignment by 
chance. SEQ ID NO:35 also has homology to nuclear transcription factors, as determined by BLAST 
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analysis using the PROTEOME database. SEQ ID NO:35 also contains a zinc finger domain as 
determined by searching for statistically significant matches in the hidden Markov model (HMM)- 
based PFAM database of conserved protein family domains. (See Table 3.) Data from BLIMPS and 
MOTIFS analyses provide further corroborative evidence that SEQ ID NO:35 is a transcription 
5 factor. 

In an alternative example, SEQ ID NO:55 is 85% identical, from residue Ml to residue V508, 
to rat serine-arginine-rich splicing regulatory protein, SRRP86 (GenBank ID g7158880) as determined 
by the Basic Local Alignment Search Tool (BLAST). (See Table 2.) The BLAST probability score 
is 4.8e-225, which indicates the probability of obtaining the observed polypeptide sequence alignment 

10 by chance. SEQ ID NO:55 also has homology to proteins that are localized to the nucleus, function as 
RNA-binding proteins, and are serine-arginine-rich proteins, as determined by BLAST analysis using 
the PROTEOME database. SEQ ID NO:55 also contains an RNA recognition motif as determined 
by searching for statistically significant matches in the hidden Markov model (HMM)-based PFAM 
database of conserved protein family domains. (See Table 3.) Data from BLIMPS, and other 

15 BLAST analyses provide further corroborative evidence that SEQ ID NO:55 is a serine-arginine-rich 
splicing regulatory protein. 

SEQ ID NO:1-10, SEQ ID NO:12-23, SEQ ID NO:25-34, SEQ ID NO:36-54 and SEQ ID 
NO:56-57 were analyzed and annotated in a similar manner. The algorithms and parameters for the 
analysis of SEQ ID NO:l-57 are described in Table 7. 

20 As shown in Table 4, the full length polynucleotide embodiments were assembled using cDNA 

sequences or coding (exon) sequences derived from genomic DNA, or any combination of these two 
types of sequences. Column 1 lists the polynucleotide sequence identification number (Polynucleotide 
SEQ ID NO:), the corresponding Incyte polynucleotide consensus sequence number (Incyte ID) for 
each polynucleotide of the invention, and the length of each polynucleotide sequence in basepairs. 

25 Column 2 shows the nucleotide start (5') and stop (3') positions of the cDNA and/or genomic 
sequences used to assemble the full length polynucleotide embodiments, and of fragments of the 
polynucleotides which are useful, for example, in hybridization or amplification technologies that 
identify SEQ ID NO:58-114 or that distinguish between SEQ ID NO:58-114 and related 
polynucleotides. 

30 The polynucleotide fragments described in Column 2 of Table 4 may refer specifically, for 

example, to Incyte cDNAs derived from tissue-specific cDNA libraries or from pooled cDNA 
libraries. Alternatively, the polynucleotide fragments described in column 2 may refer to GenBank 
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cDNAs or ESTs which contributed to the assembly of the full length polynucleotides. In addition, the 
polynucleotide fragments described in column 2 may identify sequences derived from the ENSEMBL 
(The Sanger Centre, Cambridge, UK) database (Le. 9 those sequences including the designation 
'TJNST"). Alternatively, the polynucleotide fragments described in column 2 may be derived from the 

5 NCBI RefSeq Nucleotide Sequence Records Database (Le. , those sequences including the 

designation "NM" or "NT") or the NCBI RefSeq Protein Sequence Records (i.e., those sequences 
including the designation "NP"). Alternatively, the polynucleotide fragments described in column 2 
may refer to assemblages of both cDNA and Genscan-predicted exons brought together by an "exon 
stitching" algorithm. For example, a polynucleotide sequence identified as 

10 ¥LJOOCXXXJl 1 _N 2 _YYYYYJI^N 4 represents a "stitched" sequence in which XXXXXX is the 

identification number of the cluster of sequences to which the algorithm was applied, and YYYYY is the 
number of the prediction generated by the algorithm, and N J 2 3 , if present, represent specific exons 
that may have been manually edited during analysis (See Example V). Alternatively, the 
polynucleotide fragments in column 2 may refer to assemblages of exons brought together by an 

15 "exon-stretching" algorithm. For example, a polynucleotide sequence identified as 

¥U(XXXXX_£AAAAA_£BBBBBJl_N is a "stretched" sequence, with XXXXXX being the Incyte 
project identification number, gfiAAAA being the GenBank identification number of the human 
genomic sequence to which the "exon-stretching" algorithm was applied, gBBBBB being the GenBank 
identification number or NCBI RefSeq identification number of the nearest GenBank protein homolog, 

20 and N referring to specific exons (See Example V). In instances where a RefSeq sequence was used 
as a protein homolog for the "exon-stretching" algorithm, a RefSeq identifier (denoted by "NM," 
"NP" or "NT') may be used in place of the GenBank identifier (Le. , gBBBBB). 

Alternatively, a prefix identifies component sequences that were hand-edited, predicted from 
genomic DNA sequences, or derived from a combination of sequence analysis methods. The 

25 following Table lists examples of component sequence prefixes and corresponding sequence analysis 
methods associated with the prefixes (see Example IV and Example V). 



Prefix 


Type of analysis and/or examples of programs 


GNN, GFG, 
ENST 


Exon prediction from genomic sequences using, for example, 
GENSCAN (Stanford University, CA, USA) or FGENES 
(Computer Genomics Group, The Sanger Centre, Cambridge, UK) 


GBI 


Hand-edited analysis of genomic sequences. 


FL 


Stitched or stretched genomic sequences (see Example V). 
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INCY 



Full length transcript and exon prediction from mapping of EST 
sequences to the genome. Genomic location and EST composition 
data are combined to predict the exons and resulting transcript. 



In some cases, Incyte cDNA coverage redundant with the sequence coverage shown in 
Table 4 was obtained to confirm die final consensus polynucleotide sequence, but the relevant Incyte 
5 cDNA identification numbers are not shown. 

Table 5 shows the representative cDNA libraries for those full length polynucleotides which 
were assembled using Incyte cDNA sequences. The representative cDNA library is the Incyte 
cDNA library which is most frequently represented by the Incyte cDNA sequences which were used 
to assemble and confirm the above polynucleotides. The tissues and vectors which were used to 

10 construct the cDNA libraries shown in Table 5 are described in Table 6. 

Table 8 shows single nucleotide polymorphisms (SNPs) found in polynucleotide sequences of 
the invention, along with allele frequencies in different human populations. Columns 1 and 2 show the 
polynucleotide sequence identification number (SEQ ID NO:) and the corresponding Incyte project 
identification number (PID) for polynucleotides of the invention. Column 3 shows the Incyte 

15 identification number for the EST in which the SNP was detected (EST ID), and column 4 shows the 
identification number for the SNP (SNP ID). Column 5 shows the position within the EST sequence 
at which the SNP is located (EST SNP), and column 6 shows the position of the SNP within the full- 
length polynucleotide sequence (CB1 SNP). Column 7 shows the allele found in the EST sequence. 
Columns 8 and 9 show the two alleles found at the SNP site. Column 10 shows the amino acid 

20 encoded by the codon including the SNP site, based upon the allele found in the EST. Columns 11-14 
show the frequency of allele 1 in four different human populations. An entry of n/d (not detected) 
indicates that the frequency of allele 1 in the population was too low to be detected, while n/a (not 
available) indicates that the allele frequency was not determined for the population. 

The invention also encompasses NAAP variants. Various embodiments of NAAP variants 

25 can have at least about 80%, at least about 90%, or at least about 95% amino acid sequence identity 
to the NAAP amino acid sequence, and can contain at least one functional or structural characteristic 
of NAAP. 

Various embodiments also encompass polynucleotides which encode NAAP. In a particular 
embodiment, the invention encompasses a polynucleotide sequence comprising a sequence selected 
30 from the group consisting of SEQ ID NO:58-l 14, which encodes NAAP, The polynucleotide 
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sequences of SEQ ID NO:58-l 14, as presented in the Sequence Listing, embrace the equivalent RNA 
sequences, wherein occurrences of the nitrogenous base thymine are replaced with uracil, and the 
sugar backbone is composed of ribose instead of deoxyribose. 

The invention also encompasses variants of a polynucleotide encoding NAAP. In particular, 

5 such a variant polynucleotide will have at least about 70%, or alternatively at least about 85%, or even 
at least about 95% polynucleotide sequence identity to a polynucleotide encoding NAAP. A particular 
aspect of the invention encompasses a variant of a polynucleotide comprising a sequence selected 
from the group consisting of SEQ ID NO:58-l 14 which has at least about 70%, or alternatively at 
least about 85%, or even at least about 95% polynucleotide sequence identity to a nucleic acid 

10 sequence selected from the group consisting of SEQ ID NO:58-l 14. Any one of the polynucleotide 
variants described above can encode a polypeptide which contains at least one functional or structural 
characteristic of NAAP. 

In addition, or in the alternative, a polynucleotide variant of the invention is a splice variant of a 
polynucleotide encoding NAAP. A splice variant may have portions which have significant sequence 

15 identity to a polynucleotide encoding NAAP, but will generally have a greater or lesser number of 
polynucleotides due to additions or deletions of blocks of sequence arising from alternate splicing of 
exons during mRNA processing. A splice variant may have less than about 70%, or alternatively less 
than about 60%, or alternatively less than about 50% polynucleotide sequence identity to a 
polynucleotide encoding NAAP over its entire length; however, portions of the splice variant wiD have 

20 at least about 70%, or alternatively at least about 85%, or alternatively at least about 95%, or 

alternatively 100% polynucleotide sequence identity to portions of the polynucleotide encoding NAAP. 
Any one of the splice variants described above can encode a polypeptide which contains at least one 
functional or structural characteristic of NAAP. 

It will be appreciated by those skilled in the art that as a result of the degeneracy of the 

25 genetic code, a multitude of polynucleotide sequences encoding NAAP, some bearing minimal 
similarity to the polynucleotide sequences of any known and naturally occurring gene, may be 
produced. Thus, the invention contemplates each and every possible variation of polynucleotide 
sequence that could be made by selecting combinations based on possible codon choices. These 
combinations are made in accordance with the standard triplet genetic code as applied to the 

30 polynucleotide sequence of naturally occurring NAAP, and all such variations are to be considered as 
being specifically disclosed. 

Although polynucleotides which encode NAAP and its variants are generally capable of 
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hybridizing to polynucleotides encoding naturally occurring NAAP under appropriately selected 
conditions of stringency, it may be advantageous to produce polynucleotides encoding NAAP or its 
derivatives possessing a substantially different codon usage, e.g., inclusion of non-naturally occurring 
codons. Codons may be selected to increase the rate at which expression of the peptide occurs in a 

5 particular prokaryotic or eukaryotic host in accordance with the frequency with which particular 
codons are utilized by the host. Other reasons for substantiaDy altering the nucleotide sequence 
encoding NAAP and its derivatives without altering the encoded amino acid sequences include the 
production of RNA transcripts having more desirable properties, such as a greater half-life, than 
transcripts produced from the naturally occurring sequence. 

10 The invention also encompasses production of polynucleotides which encode NAAP and 

NAAP derivatives, or fragments thereof, entirely by synthetic chemistry. After production, the 
synthetic polynucleotide maybe inserted into any of the many available expression vectors and cell 
systems using reagents well known in the art. Moreover, synthetic chemistry may be used to 
introduce mutations into a polynucleotide encoding NAAP or any fragment thereof. 

15 Embodiments of the invention can also include polynucleotides that are capable of hybridizing 

to the claimed polynucleotides, and, in particular, to those having the sequences shown in SEQ ID 
NO:58-114 and fragments thereof, under various conditions of stringency (Wahl, G.M. and S.L. 
Berger (1987) Methods Enzymol. 152:399-407; Kimmel, A.R. (1987) Methods Enzymol. 152:507-511). 
Hybridization conditions, including annealing and wash conditions, are described in 'Definitions." 

20 Methods for DNA sequencing are well known in the art and may be used to practice any of 

the embodiments of the invention. The methods may employ such enzymes as the Klenow fragment 
of DNA polymerase I, SEQUENASE (US Biochemical, Cleveland OH), Taq polymerase (Applied 
Biosystems), thermostable T7 polymerase (Amersham Biosciences, Piscataway NJ), or combinations 
of polymerases and proofreading exonucleases such as those found in the ELONGASE amplification 

25 system (Invitrogen, Carlsbad CA). Preferably, sequence preparation is automated with machines 
such as the MICROLAB 2200 liquid transfer system (Hamilton, Reno NV), PTC200 thermal cycler 
(MJ Research, Watertown MA) and ABI CATALYST 800 thermal cycler (Applied Biosystems). 
Sequencing is then carried out using either the ABI 373 or 377 DNA sequencing system (Applied 
Biosystems), the MEGABACE 1000 DNA sequencing system (Amersham Biosciences), or other 

30 systems known in the art. The resulting sequences are analyzed using a variety of algorithms which 
are well known in the art (Ausubel et al, supra, ch. 7; Meyers, R.A. (1995) Molecular Biology and 
Biotechnology , Wiley VCH, New York NY, pp. 856-853). 
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The nucleic acids encoding NAAP may be extended utilizing a partial nucleotide sequence 
and employing various PCR-based methods known in the art to detect upstream sequences, such as 
promoters and regulatory elements. For example, one method which may be employed, restriction-site 
PCR, uses universal and nested primers to amplify unknown sequence from genomic DNA within a 

5 cloning vector (Sarkar, G. (1993) PCR Methods Applic. 2:318-322). Another method, inverse PCR, 
uses primers that extend in divergent directions to amplify unknown sequence from a circularized 
template. The template is derived from restriction fragments comprising a known genomic locus and 
surrounding sequences (Triglia, T. et al. (1988) Nucleic Acids Res. 16:8186). A third method, capture 
PCR, involves PCR amplification of DNA fragments adjacent to known sequences in human and 

10 yeast artificial chromosome DNA (Lagerstrom, M. et al. (1991) PCR Methods Applic. 1:111-1 19). In 
this method, multiple restriction enzyme digestions and ligations maybe used to insert an engineered 
double-stranded sequence into a region of unknown sequence before performing PCR. Other 
methods which may be used to retrieve unknown sequences are known in the art (Parker, J.D. et al. 
(1991) Nucleic Acids Res. 19:3055-3060). Additionally, one may use PCR, nested primers, and 

15 PROMOTERHNDER libraries (Clontech, Palo Alto CA) to walk genomic DNA. This procedure 
avoids the need to screen libraries and is useful in finding intron/exon junctions. For all PCR-based 
methods, primers maybe designed using commercially available software, such as OLIGO 4.06 
primer analysis software (National Biosciences, Plymouth MN) or another appropriate program, to be 
about 22 to 30 nucleotides in length, to have a GC content of about 50% or more, and to anneal to the 

20 template at temperatures of about 68°C to 72°C. 

When screening for full length cDNAs, it is preferable to use libraries that have been 
size-selected to include larger cDNAs. In addition, random-primed libraries, which often include 
sequences containing the 5' regions of genes, are preferable for situations in which an oligo d(T) 
library does not yield a full-length cDNA. Genomic libraries may be useful for extension of sequence 

25 into 5' non-transcribed regulatory regions. 

Capillary electrophoresis systems which are commercially available may be used to analyze 
the size or confirm the nucleotide sequence of sequencing or PCR products. In particular, capillary 
sequencing may employ flowable polymers for electrophoretic separation, four different nucleotide- 
specific, laser-stimulated fluorescent dyes, and a charge coupled device camera for detection of the 

30 emitted wavelengths. Output/light intensity may be converted to electrical signal using appropriate 
software (e.g., GENOTYPER and SEQUENCE NAVIGATOR, Applied Biosystems), and the entire 
process from loading of samples to computer analysis and electronic data display may be computer 
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controlled. Capillary electrophoresis is especially preferable for sequencing small DNA fragments 
which may be present in limited amounts in a particular sample. 

In another embodiment of the invention, polynucleotides or fragments thereof which encode 
NAAP may be cloned in recombinant DNA molecules that direct expression of N A AP, or fragments 

5 or functional equivalents thereof, in appropriate host cells. Due to the inherent degeneracy of the 
genetic code, other polynucleotides which encode substantially the same or a functionally equivalent 
polypeptides may be produced and used to express NAAP. 

The polynucleotides of the invention can be engineered using methods generally known in the 
art in order to alter NAAP-encoding sequences for a variety of purposes including, but not limited to, 

10 modification of the cloning, processing, and/or expression of the gene product. DNA shuffling by 
random fragmentation and PCR reassembly of gene fragments and synthetic oligonucleotides may be 
used to engineer the nucleotide sequences. For example, oligonucleotide-mediated site-directed 
mutagenesis may be used to introduce mutations that create new restriction sites, alter glycosylation 
patterns, change codon preference, produce splice variants, and so forth. 

15 The nucleotides of the present invention may be subjected to DNA shuffling techniques such 

as MOLECULARBREEDING (Maxygen Inc., Santa Clara CA; described in U.S. Patent No. 
5,837,458; Chang, C.-C. et al. (1999) Nat. Biotechnol. 17:793-797; Christians, EC. et al. (1999) Nat. 
Biotechnol. 17:259-264; and Crameri, A. et al. (1996) Nat. Biotechnol. 14:315-319) to alter or improve 
the biological properties of NAAP, such as its biological or enzymatic activity or its ability to bind to 

20 other molecules or compounds. DNA shuffling is a process by which a library of gene variants is 
produced using PCR-mediated recombination of gene fragments. The library is then subjected to 
selection or screening procedures that identify those gene variants with the desired properties. These 
preferred variants may then be pooled and further subjected to recursive rounds of DNA shuffling and 
selection/screening. Thus, genetic diversity is created through "artificial" breeding and rapid molecular 

25 evolution. For example, fragments of a single gene containing random point mutations may be 

recombined, screened, and then reshuffled until the desired properties are optimized, Alternatively, 
fragments of a given gene may be recombined with fragments of homologous genes in the same gene 
family, either from the same or different species, thereby maximizing the genetic diversity of multiple 
naturally occurring genes in a directed and controllable manner. 

30 In another embodiment, polynucleotides encoding NAAP may be synthesized, in whole or in 

part, using one or more chemical methods well known in the art (Caruthers, M.H. et al. (1980) 
Nucleic Acids Symp. Ser. 7:215-223; Horn, T. et al. (1980) Nucleic Acids Symp. Ser. 7:225-232). 
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Alternatively, N AAP itself or a fragment thereof may be synthesized using chemical methods known 
in the art For example, peptide synthesis can be performed using various solution-phase or 
solid-phase techniques (Creighton, T. (1984) Proteins, Structures and Molecular Properties . WH 
Freeman, New York NY, pp. 55-60; Roberge, J.Y. et al. (1995) Science 269:202-204). Automated 

5 synthesis may be achieved using the ABI 43 1 A peptide synthesizer (Applied Biosystems). 

Additionally, the amino acid sequence of NAAP, or any part thereof, may be altered during direct 
synthesis and/or combined with sequences from other proteins, or any part thereof, to produce a 
variant polypeptide or a polypeptide having a sequence of a naturally occurring polypeptide. 
The peptide may be substantially purified by preparative high performance liquid 

10 chromatography (Chiez, R.M. and F.Z. Regnier (1990) Methods Enzymol. 182:392-421). The 
composition of the synthetic peptides maybe confirmed by amino acid analysis or by sequencing 
(Creighton, supra, pp. 28-53). 

In order to express a biologically active NAAP, the polynucleotides encoding NAAP or 
derivatives thereof may be inserted into an appropriate expression vector, i.e., a vector which contains 

15 the necessary elements for transcriptional and translational control of the inserted coding sequence in 
a suitable host. These elements include regulatory sequences, such as enhancers, constitutive and 
inducible promoters, and 5 'and 3 'untranslated regions in the vector and in polynucleotides encoding 
NAAP. Such elements may vary in their strength and specificity. Specific initiation signals may also 
be used to achieve more efficient translation of polynucleotides encoding NAAP. Such signals include 

20 the ATG initiation codon and adjacent sequences, e.g. the Kozak sequence. In cases where a 

polynucleotide sequence encoding NAAP and its initiation codon and upstream regulatory sequences 
are inserted into the appropriate expression vector, no additional transcriptional or translational control 
signals may be needed. However, in cases where only coding sequence, or a fragment thereof, is 
inserted, exogenous translational control signals including an in-frame ATG initiation codon should be 

25 provided by the vector. Exogenous translational elements and initiation codons may be of various 
origins, both natural and synthetic. The efficiency of expression may be enhanced by the inclusion of 
enhancers appropriate for the particular host cell system used (Scharf, D. et al. (1994) Results Probl. 
Cell Differ. 20:125-162). 

Methods which are well known to those skilled in the art may be used to construct expression 

30 vectors containing polynucleotides encoding NAAP and appropriate transcriptional and translational 
control elements. These methods include in vitro recombinant DNA techniques, synthetic techniques, 
and in vivo genetic recombination (Sambrook and Russell, supra, ch. 1-4, and 8; Ausubel et al., 
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supra, ch. 1, 3, and 15). 

A variety of expression vector/host systems may be utilized to contain and express 
polynucleotides encoding NAAP. These include, but are not limited to, microorganisms such as 
bacteria transformed with recombinant bacteriophage, plasmid, or cosmid DNA expression vectors; 

5 yeast transformed with yeast expression vectors; insect cell systems infected with viral expression 
vectors (e.g., baculovirus); plant cell systems transformed with viral expression vectors (e.g., 
cauliflower mosaic virus, CaMV, or tobacco mosaic virus, TMV) or with bacterial expression vectors 
(e.g., Ti or pBR322 plasmids); or animal cell systems (Sambrook and Russell, supra', Ausubel et al., 
supra; Van Heeke, G. and S.M. Schuster (1989) J. Biol. Chem. 264:5503-5509; Engelhard, E.K. et al. 

10 (1994) Proc. Natl. Acad. Sci. USA 91:3224-3227; Sandig, V. et al. (1996) Hum. Gene Ther. 7:1937- 
1945; Takamatsu, N. (1987) EMBO J. 6:307-311; The McGraw Hill Yearbook of Science and 
Technology (1992) McGraw Hill, New York NY, pp. 191-196; Logan, J. and T. Shenk (1984) Proc. 
Nad. Acad. Sci. USA 81:3655-3659; Harrington, J.J. et al. (1997) Nat. Genet. 15:345-355). 
Expression vectors derived from retroviruses, adenoviruses, or herpes or vaccinia viruses, or from 

15 various bacterial plasmids, may be used for delivery of polynucleotides to the targeted organ, tissue, or 
cell population (Di Nicola, M. et al. (1998) Cancer Gen. Ther. 5:350-356; Yu, M. et al. (1993) Proc. 
Nad. Acad. Sci. USA 90:6340-6344; BuUer, R.M. et al. (1985) Nature 317:813-815; McGregor, D.P. 
et al. (1994) Mol. Immunol. 31:219-226; Verma, I.M. and N. Somia (1997) Nature 389:239-242). The 
invention is not limited by the host cell employed. 

20 In bacterial systems, a number of cloning and expression vectors may be selected depending 

upon the use intended for polynucleotides encoding NAAP. For example, routine cloning, subcloning, 
and propagation of polynucleotides encoding NAAP can be achieved using a multifunctional E. coli 
vector such as PBLUESCRIPT (Stratagene, La Jolla CA) or PSPORT1 plasmid (Invitrogen). 
Ligation of polynucleotides encoding NAAP into the vector's multiple cloning site disrupts the lacZ 

25 gene, allowing a colorimetric screening procedure for identification of transformed bacteria containing 
recombinant molecules. In addition, these vectors may be useful for in vitro transcription, dideoxy 
sequencing, single strand rescue with helper phage, and creation of nested deletions in the cloned 
sequence (Van Heeke, G. and S.M. Schuster (1989) J. Biol. Chem. 264:5503-5509). When large 
quantities of NAAP are needed, e.g. for the production of antibodies, vectors which direct high level 

30 expression of NAAP may be used. For example, vectors containing the strong, inducible SP6 or T7 
bacteriophage promoter maybe used. 

Yeast expression systems may be used for production of NAAP. A number of vectors 
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containing constitutive or inducible promoters, such as alpha factor, alcohol oxidase, and PGH 
promoters, may be used in the yeast Saccharomyces cerevisiae or Pichia pastoris. In addition, such 
vectors direct either the secretion or intracellular retention of expressed proteins and enable integration 
of foreign polynucleotide sequences into the host genome for stable propagation (Ausubel et al, 

5 supra; Bitter, G.A. et al. (1987) Methods Enzymol. 153:516-544; Scorer, C.A. et al. (1994) 
Bio/Technology 12:181-184). 

Plant systems may also be used for expression of NA AP. Transcription of polynucleotides 
encoding NAAP may be driven by viral promoters, e.g., the 35S and 19S promoters of CaMV used 
alone or in combination with the omega leader sequence from TMV (Takamatsu, N. (1987) EMBO J. 

10 6:307-31 1). Alternatively, plant promoters such as the small subunit of RUBISCO or heat shock 
promoters maybe used (Coruzzi, G. et al. (1984) EMBO J. 3:1671-1680; Broglie, R. et al. (1984) 
Science 224:838-843; Winter, J. et al. (1991) Results Probl. Cell Differ. 17:85-105). These constructs 
can be introduced into plant cells by direct DNA transformation or pathogen-mediated transfection 
(The McGraw Hill Yearbook of Science and Technology (1992) McGraw Hill, New York NY, pp. 

15 191-196). 

In mammalian cells, a number of viral-based expression systems may be utilized. In cases 
where an adenovirus is used as an expression vector, polynucleotides encoding NAAP may be ligated 
into an adenovirus transcription/translation complex consisting of the late promoter and tripartite leader 
sequence. Insertion in a non-essential El or E3 region of the viral genome may be used to obtain 

20 infective virus which expresses NAAP in host cells (Logan, J. and T. Shenk (1984) Proc. Natl. Acad. 
Sci. USA 81:3655-3659). In addition, transcription enhancers, such as the Rous sarcoma vims (RSV) 
enhancer, may be used to increase expression in mammalian host cells. SV40 or EBV-based vectors 
may also be used for high-level protein expression. 

Human artificial chromosomes (HACs) may also be employed to deliver larger fragments of 

25 DNA than can be contained in and expressed from a plasmid. HACs of about 6 kb to 10 Mb are 

constructed and delivered via conventional delivery methods (liposomes, polycationic amino polymers, 
or vesicles) for therapeutic purposes (Harrington, JJ. et al. (1997) Nat. Genet. 15:345-355). 

For long term production of recombinant proteins in mammalian systems, stable expression of 
NAAP in cell lines is preferred. For example, polynucleotides encoding NAAP can be transformed 

30 into cell lines using expression vectors which may contain viral origins of replication and/or 

endogenous expression elements and a selectable marker gene on the same or on a separate vector. 
Following the introduction of the vector, cells maybe allowed to grow for about 1 to 2 days in enriched 
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media before being switched to selective media. The purpose of the selectable marker is to confer 
resistance to a selective agent, and its presence allows growth and recovery of cells which 
successfully express the introduced sequences. Resistant clones of stably transformed cells may be 
propagated using tissue culture techniques appropriate to the cell type. 

5 Any number of selection systems may be used to recover transformed cell lines. These 

include, but are not limited to, the herpes simplex virus thymidine kinase and adenine 
phosphoribosyltransferase genes, for use in tkr and apr cells, respectively (Wigler, M. et al. (1977) 
Cell 11:223-232; Lowy, I. et al. (1980) Cell 22:817-823). Also, antimetabolite, antibiotic, or herbicide 
resistance can be used as the basis for selection. For example, dhfr confers resistance to 

10 methotrexate; neo confers resistance to the aminoglycosides neomycin and G-418; and als and pat 
confer resistance to chlorsulfuron and phosphinotricin acetyltransferase, respectively (Wigler, M. et al. 
(1980) Proc. Natl. Acad. Sci. USA 77:3567-3570; Colbere-Garapin, F. et al. (1981) J. MoL Biol. 
150:1-14). Additional selectable genes have been described, e.g., trpB and hisD, which alter cellular 
requirements for metabolites (Hartman, S.C and R.C. Mulligan (1988) Proc. Natl. Acad. Sci. USA 

15 85:8047-8051). Visible markers, e.g., anthocyanins, green fluorescent proteins (GFP; Clontech), 0- 
glucuronidase and its substrate (i-glucuronide, or hiciferase and its substrate luciferin may be used. 
These markers can be used not only to identify transformants, but also to quantify the amount of 
transient or stable protein expression attributable to a specific vector system (Rhodes, C. A. (1995) 
Methods MoL Biol. 55:121-131). 

20 Although the presence/absence of marker gene expression suggests that the gene of interest 

is also present, the presence and expression of the gene may need to be confirmed. For example, if 
the sequence encoding NAAP is inserted within a marker gene sequence, transformed cells containing 
polynucleotides encoding NAAP can be identified by the absence of marker gene function. 
Alternatively, a marker gene can be placed in tandem with a sequence encoding NAAP under the 

25 control of a single promoter. Expression of the marker gene in response to induction or selection 
usually indicates expression of the tandem gene as well. 

In general, host cells that contain the polynucleotide encoding NAAP and that express NAAP 
may be identified by a variety of procedures known to those of skill in the art These procedures 
include, but are not limited to, DNA-DNA or DNA-RNA hybridizations, PCR amplification, and 

30 protein bioassay or immunoassay techniques which include membrane, solution, or chip based 
technologies for the detection and/or quantification of nucleic acid or protein sequences. 

Immunological methods for detecting and measuring the expression of NAAP using either 
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specific polyclonal or monoclonal antibodies are known in the art Examples of such techniques 
include enzyme-linked immunosorbent assays (ELISAs), radioimmunoassays (RIAs), and 
fluorescence activated cell sorting (FACS). A two-site, monoclonal-based immunoassay utilizing 
monoclonal antibodies reactive to two non-interfering epitopes on NAAP is preferred, but a 
5 competitive binding assay may be employed. These and other assays are well known in the art 

(Hampton, R. et al. (1990) Serological Methods, a Laboratory Manual . APS Press, St. Paul MN, Sect. 
IV; Coligan, J.E. et al. (1997) Current Protocols in Immunology . Greene Pub. Associates and Wiley- 
Interscience, New York NY; Pound, J.D. (1998) Immunochemical Protocols . Humana Press, Totowa 
NJ). 

10 A wide variety of labels and conjugation techniques are known by those skilled in the art and 

may be used in various nucleic acid and amino acid assays. Means for producing labeled hybridization 
or PCR probes for detecting sequences related to polynucleotides encoding NAAP include 
oligolabeling, nick translation, end-labeling, or PCR amplification using a labeled nucleotide. 
Alternatively, polynucleotides encoding NAAP, or any fragments thereof, may be cloned into a vector 

15 for the production of an mRNA probe. Such vectors are known in the art, are commercially available, 
and may be used to synthesize RNA probes in vitro by addition of an appropriate RNA polymerase 
such as T7, T3, or SP6 and labeled nucleotides. These procedures may be conducted using a variety 
of commercially available kits, such as those provided by Amersham Biosciences, Promega (Madison 
WI), and US Biochemical. Suitable reporter molecules or labels which may be used for ease of 

20 detection include radionuclides, enzymes, fluorescent, chemiluminescent, or chromogenic agents, as 
well as substrates, cofactors, inhibitors, magnetic particles, and the like. 

Host cells transformed with polynucleotides encoding NAAP may be cultured under 
conditions suitable for the expression and recovery of the protein from cell culture. The protein 
produced by a transformed cell may be secreted or retained intracellularly depending on the sequence 

25 and/or the vector used. As will be understood by those of skill in the art, expression vectors containing 
polynucleotides which encode NAAP may be designed to contain signal sequences which direct 
secretion of NAAP through a prokaryotic or eukaryotic cell membrane. 

In addition, a host cell strain may be chosen for its ability to modulate expression of the 
inserted polynucleotides or to process the expressed protein in the desired fashion. Such modifications 

30 of the polypeptide include, but are not limited to, acetylation, carboxylation, glycosylation, 

phosphorylation, lipidation, and acylation. Post-translational processing which cleaves a "prepro" or 
"pro" form of the protein may also be used to specify protein targeting, folding, and/or activity. 
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Different host cells which have specific cellular machinery and characteristic mechanisms for 
post-translational activities (e.g., CHO, HeLa, MDCK, HEK293, and WI38) are available from the 
American Type Culture Collection (ATCC, Manassas VA) and may be chosen to ensure the correct 
modification and processing of the foreign protein. 

5 In another embodiment of the invention, natural, modified, or recombinant polynucleotides 

encoding NAAP may be ligated to a heterologous sequence resulting in translation of a fusion protein 
in any of the aforementioned host systems. For example, a chimeric NAAP protein containing a 
heterologous moiety that can be recognized by a commercially available antibody may facilitate the 
screening of peptide libraries for inhibitors of NAAP activity. Heterologous protein and peptide 

10 moieties may also facilitate purification of fusion proteins using commercially available affinity 
matrices. Such moieties include, but are not limited to, glutathione S-transferase (GST), maltose 
binding protein (MBP), thioredoxin (Trx), calmodulin binding peptide (CBP), 6-His, FLAG, c-myc, and 
hemagglutinin (HA). GST, MBP, Trx, CBP, and 6-His enable purification of their cognate fusion 
proteins on immobilized glutathione, maltose, phenylarsine oxide, calmodulin, and metal-chelate resins, 

15 respectively. FLAG, c-myc, and hemagglutinin (HA) enable immunoaffinity purification of fusion 
proteins using commercially available monoclonal and polyclonal antibodies that specifically recognize 
these epitope tags. A fusion protein may also be engineered to contain a proteolytic cleavage site 
located between the NAAP encoding sequence and the heterologous protein sequence, so that NAAP 
may be cleaved away from the heterologous moiety following purification. Methods for fusion protein 

20 expression and purification are discussed in Ausubel et al. (supra, ch. 10 and 16). A variety of 

commercially available kits may also be used to facilitate expression and purification of fusion proteins. 

In another embodiment, synthesis of radiolabeled NAAP may be achieved in vitro using the 
TNT rabbit reticulocyte lysate or wheat germ extract system (Promega). These systems couple 
transcription and translation of protein-coding sequences operably associated with the 17, T3, or SP6 

25 promoters. Translation takes place in the presence of a radiolabeled amino acid precursor, for 
example, 35 S -methionine. 

NAAP, fragments of NAAP, or variants of NAAP may be used to screen for compounds 
that specifically bind to NAAP. One or more test compounds may be screened for specific binding to 
NAAP. In various embodiments, 1, 2, 3, 4, 5, 10, 20, 50, 100, or 200 test compounds can be screened 

30 for specific binding to NAAP. Examples of test compounds can include antibodies, anticalins, 
oligonucleotides, proteins (e.g., ligands or receptors), or small molecules. 

In related embodiments, variants of NAAP can be used to screen for binding of test 
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compounds, such as antibodies, to NAAP, a variant of NAAP, or a combination of NAAP and/or one 
or more variants NAAP. In an embodiment, a variant of NAAP can be used to screen for 
compounds that bind to a variant of NAAP, but not to NAAP having the exact sequence of a 
sequence of SEQ ID NO:l-57. NAAP variants used to perform such screening can have a range of 

5 about 50% to about 99% sequence identity to NAAP, with various embodiments having 60%, 70%, 
75%, 80%, 85%, 90%, and 95% sequence identity. 

In an embodiment, a compound identified in a screen for specific binding to NAAP can be 
closely related to the natural ligand of NAAP, e.g., a ligand or fragment thereof, a natural substrate, a 
structural or functional mimetic, or a natural binding partner (Coligan, J.E. et al. (1991) Current 

10 Protocols in ImmunoloEV l(2):Chapter 5). In another embodiment, the compound thus identified can 
be a natural ligand of a receptor NAAP (Howard, A.D. et al. (2001) Trends Pharmacol. Sci.22:132- 
140; Wise, A. et al. (2002) Drug Discovery Today 7:235-246). 

In other embodiments, a compound identified in a screen for specific binding to NAAP can be 
closely related to the natural receptor to which NAAP binds, at least a fragment of the receptor, or a 

15 fragment of the receptor including all or a portion of the ligand binding site or binding pocket. For 
example, the compound may be a receptor for NAAP which is capable of propagating a signal, or a 
decoy receptor for NAAP which is not capable of propagating a signal (Ashkenazi, A. and V.M. Divit 
(1999) Curr. Opin. Cell Biol. 11:255-260; Mantovani, A. et al. (2001) Trends Immunol. 22:328-336). 
The compound can be rationally designed using known techniques. Examples of such techniques 

20 include those used to construct the compound etanercept (ENBREL; Amgen Inc., Thousand Oaks 
CA), which is efficacious for treating rheumatoid arthritis in humans. Etanercept is an engineered p75 
tumor necrosis factor (TNF) receptor dimer linked to the Fc portion of human IgG! (Taylor, P.C et al. 
(2001) Curr. Opin. Immunol. 13:611-616). 

In one embodiment, two or more antibodies having similar or, alternatively, different 

25 specificities can be screened for specific binding to NAAP, fragments of NAAP, or variants of 
NAAP. The binding specificity of the antibodies thus screened can thereby be selected to identify 
particular fragments or variants of NAAP. In one embodiment, an antibody can be selected such that 
its binding specificity allows for preferential identification of specific fragments or variants of NAAP. 
In another embodiment, an antibody can be selected such that its binding specificity allows for 

30 preferential diagnosis of a specific disease or condition having increased, decreased, or otherwise 
abnormal production of NAAP. 

In an embodiment, anticalins can be screened for specific binding to NAAP, fragments of 
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NA AP, or variants of NAAP. Anticalins are ligand-binding proteins that have been constructed based 
on a lipocalin scaffold (Weiss, G.A. and H.B. Lowman (2000) Chem. Biol. 7:R177-R184; Skerra, A. 
(2001) J. Biotechnol. 74:257-275). The protein architecture of lipocalins can include a beta-barrel 
having eight antiparallel beta-strands, which supports four loops at its open end. These loops form the 

5 natural ligand-binding site of the lipocalins, a site which can be re-engineered in vitro by amino acid 
substitutions to impart novel binding specificities. The amino acid substitutions can be made using 
methods known in the art or described herein, and can include conservative substitutions (e.g., 
substitutions that do not alter binding specificity) or substitutions that modestly, moderately, or 
significantly alter binding specificity. 

10 In one embodiment, screening for compounds which specifically bind to, stimulate, or inhibit 

NAAP involves producing appropriate cells which express N AAP, either as a secreted protein or on 
the cell membrane. Preferred cells can include cells from mammals, yeast, Drosophila, or E. coli. 
Cells expressing NAAP or cell membrane fractions which contain NAAP are then contacted with a 
test compound and binding, stimulation, or inhibition of activity of either NAAP or the compound is 

15 analyzed. 

An assay may simply test binding of a test compound to the polypeptide, wherein binding is 
detected by a fluorophore, radioisotope, enzyme conjugate, or other detectable label. For example, the 
assay may comprise the steps of combining at least one test compound with NAAP, either in solution 
or affixed to a solid support, and detecting the binding of NAAP to the compound. Alternatively, the 

20 assay may detect or measure binding of a test compound in the presence of a labeled competitor. 
Additionally, the assay may be carried out using cell-free preparations, chemical libraries, or natural 
product mixtures, and the test compound(s) may be free in solution or affixed to a solid support. 

An assay can be used to assess the ability of a compound to bind to its natural ligand and/or to 
inhibit the binding of its natural ligand to its natural receptors. Examples of such assays include radio- 

25 labeling assays such as those described in U.S. Patent No. 5,914,236 and U.S. Patent No. 6,372,724. 
In a related embodiment, one or more amino acid substitutions can be introduced into a polypeptide 
compound (such as a receptor) to improve or alter its ability to bind to its natural ligands (Matthews, 
D.J. and J. A. Wells. (1994) Chem. BioL 1:25-30). In another related embodiment, one or more amino 
acid substitutions can be introduced into a polypeptide compound (such as a ligand) to improve or alter 

30 its ability to bind to its natural receptors (Cunningham, B.C. and J.A. Wells (1991) Proc. Natl. Acad. 
Sci. USA 88:3407-3411; Lowman, H.B. et al. (1991) J. Biol. Chem. 266:10982-10988). 

NAAP, fragments of NAAP, or variants of NAAP may be used to screen for compounds 
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that modulate the activity of NAAP. Such compounds may include agonists, antagonists, or partial or 
inverse agonists. In one embodiment, an assay is performed under conditions permissive for NAAP 
activity, wherein NAAP is combined with at least one test compound, and the activity of NAAP in the 
presence of a test compound is compared with the activity of NAAP in the absence of the test 

5 compound. A change in the activity of NAAP in the presence of the test compound is indicative of a 
compound that modulates the activity of NAAP. Alternatively, a test compound is combined with an 
in vitro or cell-free system comprising NAAP under conditions suitable for NAAP activity, and the 
assay is performed. In either of these assays, a test compound which modulates the activity of 
NAAP may do so indirectly and need not come in direct contact with the test compound. At least one 

10 and up to a plurality of test compounds may be screened. 

In another embodiment, polynucleotides encoding NAAP or their mammalian homologs may 
be "knocked out" in an animal model system using homologous recombination in embryonic stem (ES) 
cells. Such techniques are well known in the art and are useful for the generation of animal models of 
human disease (see, e.g., U.S. Patent No. 5,175,383 and U.S. Patent No. 5,767,337). For example, 

15 mouse ES cells, such as the mouse 129/SvJ cell line, are derived from the early mouse embryo and 
grown in culture. The ES cells are transformed with a vector containing the gene of interest disrupted 
by a marker gene, e.g., the neomycin phosphotransferase gene (neo\ Capecchi, M.R. (1989) Science 
244:1288-1292). The vector integrates into the corresponding region of the host genome by 
homologous recombination. Alternatively, homologous recombination takes place using the Cre-loxP 

20 system to knockout a gene of interest in a tissue- or developmental stage-specific manner (Marth, J.D. 
(1996) Clin. Invest. 97:1999-2002; Wagner, K.U. et aL (1997) Nucleic Acids Res. 25:4323-4330). 
Transformed ES cells are identified and microinjected into mouse cell blastocysts such as those from 
the C57BL/6 mouse strain. The blastocysts are surgically transferred to pseudopregnant dams, and 
the resulting chimeric progeny are genotyped and bred to produce heterozygous or homozygous 

25 strains. Transgenic animals thus generated maybe tested with potential therapeutic or toxic agents. 

Polynucleotides encoding NAAP may also be manipulated in vitro in ES cells derived from 
human blastocysts. Human ES cells have the potential to differentiate into at least eight separate cell 
lineages including endoderm, mesoderm, and ectodermal cell types. These cell lineages differentiate 
into, for example, neural cells, hematopoietic lineages, and cardiomyocytes (Thomson, J. A. et al. 

30 (1998) Science 282:1145-1147). 

Polynucleotides encoding NAAP can also be used to create "knockin" humanized animals 
(pigs) or transgenic animals (mice or rats) to model human disease. With knockin technology, a region 
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of a polynucleotide encoding NAAP is injected into animal ES cells, and the injected sequence 
integrates into the animal cell genome. Transformed cells are injected into blastulae, and the blastulae 
are implanted as described above. Transgenic progeny or inbred lines are studied and treated with 
potential pharmaceutical agents to obtain information on treatment of a human disease. Alternatively, 
5 a mammal inbred to overexpress NAAP, e.g., by secreting NAAP in its milk, may also serve as a 
convenient source of that protein (Janne, J. et al. (1998) Biotechnol. Annu. Rev. 4:55-74). 
THERAPEUTICS 

Chemical and structural similarity, e.g., in the context of sequences and motifs, exists between 
regions of NAAP and nucleic acid-associated proteins. In addition, examples of tissues expressing 

10 NAAP can be found in Table 6 and can also be found in Example XI. Therefore, NAAP appears to 
play a role in cell proliferative, neurological, developmental, and autoimmune/inflammatory disorders, 
and infections. In the treatment of disorders associated with increased NAAP expression or activity, 
it is desirable to decrease the expression or activity of NAAP. In the treatment of disorders 
associated with decreased NAAP expression or activity, it is desirable to increase the expression or 

15 activity of NAAP. 

Therefore, in one embodiment, NAAP or a fragment or derivative thereof may be 
administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of NAAP. Examples of such disorders include, but are not limited to, a cell proliferative 
disorder such as actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, cirrhosis, hepatitis, mixed 

20 connective tissue disease (MCTD), myelofibrosis, paroxysmal nocturnal hemoglobinuria, polycythemia 
vera, psoriasis, primary thrombocythemia, and cancers including adenocarcinoma, leukemia, 
lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma, and, in particular, a cancer of the adrenal 
gland, bladder, bone, bone marrow, brain, breast, cervix, colon, gall bladder, ganglia, gastrointestinal 
tract, heart, kidney, liver, lung, muscle, ovary, pancreas, parathyroid, penis, prostate, salivary glands, 

25 skin, spleen, testis, thymus, thyroid, and uterus; a neurological disorder such as epilepsy, ischemic 
cerebrovascular disease, stroke, cerebral neoplasms, Alzheimer's disease, Pick's disease, 
Huntington's disease, dementia, Parkinson's disease and other extrapyramidal disorders, amyotrophic 
lateral sclerosis and other motor neuron disorders, progressive neural muscular atrophy, retinitis 
pigmentosa, hereditary ataxias, multiple sclerosis and other demyelinating diseases, bacterial and viral 

30 meningitis, brain abscess, subdural empyema, epidural abscess, suppurative intracranial 

thrombophlebitis, myelitis and radiculitis, viral central nervous system disease, prion diseases including 
kuru, Creutzfeldt-Jakob disease, and Gerstmann-Straussler-Scheinker syndrome, fatal familial 
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insomnia, nutritional and metabolic diseases of the nervous system, neurofibromatosis, tuberous 
sclerosis, cerebelloretinal hemangioblastomatosis, encephalotrigeminal syndrome, mental retardation 
and other developmental disorder of the central nervous system, cerebral palsy, a neuroskeletal 
disorder, an autonomic nervous system disorder, a cranial nerve disorder, a spinal cord disease, 

5 muscular dystrophy and other neuromuscular disorder, a peripheral nervous system disorder, 

dermatomyositis and polymyositis, inherited, metabolic, endocrine, and toxic myopathy, myasthenia 
gravis, periodic paralysis, a mental disorder including mood, anxiety, and schizophrenic disorder, 
seasonal affective disorder (SAD), akathesia, amnesia, catatonia, diabetic neuropathy, tardive 
dyskinesia, dystonias, paranoid psychoses, postherpetic neuralgia, and Tourette's disorder; a 

10 developmental disorder such as renal tubular acidosis, anemia, Cushing's syndrome, achondroplastic 
dwarfism, Duchenne and Becker muscular dystrophy, epilepsy, gonadal dysgenesis, WAGR syndrome 
(Wilms* tumor, aniridia, genitourinary abnormalities, and mental retardation), Smith-Magenis syndrome, 
myelodysplastic syndrome, hereditary mucoepithelial dysplasia, hereditary keratodermas, hereditary 
neuropathies such as Charcot-Marie-Tooth disease and neurofibromatosis, hypothyroidism, 

15 hydrocephalus, seizure disorders such as Syndenham's chorea and cerebral palsy, spina bifida, 
anencephaly, craniorachischisis, congenital glaucoma, cataract, and sensorineural hearing loss; an 
autoimmune/inflammatory disorder such as acquired immunodeficiency syndrome (AIDS), Addison's 
disease, adult respiratory distress syndrome, allergies, ankylosing spondylitis, amyloidosis, anemia, 
asthma, atherosclerosis, autoimmune hemolytic anemia, autoimmune thyroiditis, autoimmune 

20 rx)lyendocrinopathy-candidiasis-ectodermal dystrophy (APECED), bronchitis, cholecystitis, contact 
dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, diabetes mellitus, emphysema, episodic 
lymphopenia with lymphocytotoxins, erythroblastosis fetalis, erythema nodosum, atrophic gastritis, 
glomerulonephritis, Goodpasture's syndrome, gout, Graves' disease, Hashimoto's thyroiditis, 
hypereosinophilia, irritable bowel syndrome, multiple sclerosis, myasthenia gravis, myocardial or 

25 pericardial inflammation, osteoarthritis, osteoporosis, pancreatitis, polymyositis, psoriasis, Reiter's 
syndrome, rheumatoid arthritis, scleroderma, Sjogren's syndrome, systemic anaphylaxis, systemic 
lupus erythematosus, systemic sclerosis, thrombocytopenic purpura, ulcerative colitis, uveitis, Werner 
syndrome, complications of cancer, hemodialysis, and extracorporeal circulation, viral, bacterial, 
fungal, parasitic, protozoal, and helminthic infections, and trauma; and an infection, such as those 

30 caused by a viral agent classified as adenovirus, arenavirus, bunyavirus, calicivirus, coronavirus, 
filovirus, hepadnavirus, herpesvirus, flavivirus, orthomyxovirus, parvovirus, papovavirus, 
paramyxovirus, picornavirus, poxvirus, reovirus, retrovirus, rhabdovirus, or togavirus; an infection 
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caused by a bacterial agent classified as pneumococcus, staphylococcus, streptococcus, bacillus, 
corynebacterium, Clostridium, meningococcus, gonococcus, listeria, moraxella, kingella, haemophihis, 
legionella, bordetella, gram-negative enterobacterium including shigeDa, salmonella, or Campylobacter, 
pseudomonas, vibrio, brucella, francisella, yersinia, bartonella, norcardium, actinomyces, 

5 mycobacterium, spirochaetale, rickettsia, chlamydia, or mycoplasma; an infection caused by a fungal 
agent classified as aspergillus, blastomyces, dermatophytes, cryptococcus, coccidioides, malasezzia, 
histoplasma, or other mycosis-causing fungal agent; and an infection caused by a parasite classified as 
Plasmodium or malaria-causing, parasitic entamoeba, leishmania, trypanosoma, toxoplasma, 
Pneumocystis carinii, intestinal protozoa such as giardia, trichomonas, tissue nematode such as 

10 trichinella, intestinal nematode such as ascaris, lymphatic filarial nematode, trematode such as 
schistosoma, and cestode such as tapeworm. 

In another embodiment, a vector capable of expressing NAAP or a fragment or derivative 
thereof may be administered to a subject to treat or prevent a disorder associated with decreased 
expression or activity of NAAP including, but not limited to, those described above. 

15 In a further embodiment, a composition comprising a substantially purified NAAP in 

conjunction with a suitable pharmaceutical carrier may be administered to a subject to treat or prevent 
a disorder associated with decreased expression or activity of NAAP including, but not limited to, 
those provided above. 

In still another embodiment, an agonist which modulates the activity of NAAP may be 

20 administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of NAAP including, but not limited to, those listed above. 

In a further embodiment, an antagonist of NAAP may be administered to a subject to treat or 
prevent a disorder associated with increased expression or activity of NAAP. Examples of such 
disorders include, but are not limited to, those cell proliferative, neurological, developmental, and 

25 autoimmune/inflammatory disorders, and infections described above. In one aspect, an antibody which 
specifically binds NAAP may be used directly as an antagonist or indirectly as a targeting or delivery 
mechanism for bringing a pharmaceutical agent to cells or tissues which express NAAP. 

In an additional embodiment, a vector expressing the complement of the polynucleotide 
encoding NAAP may be administered to a subject to treat or prevent a disorder associated with 

30 increased expression or activity of NAAP including, but not limited to, those described above. 

In other embodiments, any protein, agonist, antagonist, antibody, complementary sequence, or 
vector embodiments may be administered in combination with other appropriate therapeutic agents. 
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Selection of the appropriate agents for use in combination therapy may be made by one of ordinary 
skill in the art, according to conventional pharmaceutical principles. The combination of therapeutic 
agents may act synergistically to effect the treatment or prevention of the various disorders described 
above. Using this approach, one may be able to achieve therapeutic efficacy with lower dosages of 

5 each agent, thus reducing the potential for adverse side effects. 

An antagonist of NA AP may be produced using methods which are generally known in the 
art In particular, purified NAAP may be used to produce antibodies or to screen libraries of 
pharmaceutical agents to identify those which specifically bind NAAP. Antibodies to NAAP may 
also be generated using methods that are well known in the art. Such antibodies may include, but are 

10 not limited to, polyclonal, monoclonal, chimeric, and single chain antibodies, Fab fragments, and 

fragments produced by a Fab expression library. In an embodiment, neutralizing antibodies (i.e., those 
which inhibit dimer formation) can be used therapeutically. Single chain antibodies (e.g., from camels 
or llamas) may be potent enzyme inhibitors and may have application in the design of peptide mimetics, 
and in the development of immuno-adsorbents and biosensors (Muyldermans, S. (2001) J. Biotechnol. 

15 74:277-302). 

For the production of antibodies, various hosts including goats, rabbits, rats, mice, camels, 
dromedaries, llamas, humans, and others maybe immunized by injection with NAAP or with any 
fragment or oligopeptide thereof which has immunogenic properties. Depending on the host species, 
various adjuvants maybe used to increase immunological response. Such adjuvants include, but are 
20 not limited to, Freund's, mineral gels such as aluminum hydroxide, and surface active substances such 
as lysolecithin, phironic polyols, polyanions, peptides, oil emulsions, KLH, and dinitrophenol. Among 
adjuvants used in humans, BCG (bacilli Calmette-Guerin) and Corynebacterium parvum are 
especially preferable. 

It is preferred that the oligopeptides, peptides, or fragments used to induce antibodies to 
25 NAAP have an amino acid sequence consisting of at least about 5 amino acids, and generally will 
consist of at least about 10 amino acids. It is also preferable that these oligopeptides, peptides, or 
fragments are substantially identical to a portion of the amino acid sequence of the natural protein. 
Short stretches of NAAP amino acids may be fused with those of another protein, such as KLH, and 
antibodies to the chimeric molecule may be produced. 
30 Monoclonal antibodies to NAAP may be prepared using any technique which provides for the 

production of antibody molecules by continuous cell lines in culture. These include, but are not limited 
to, the hybridoma technique, the human B-cell hybridoma technique, and the EBV-hybridoma 
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technique (Kohler, G. et al. (1975) Nature 256:495-497; Kozbor, D. et al. (1985) J. Immunol. Methods 

81:31-42; Cote, R.J. et al. (1983) Proc. Natl. Acad. Sci. USA 80:2026-2030; Cole, S.P. et al. (1984) 

Mol. Cell Biol. 62:109-120). 

In addition, techniques developed for the production of "chimeric antibodies," such as the 
5 splicing of mouse antibody genes to human antibody genes to obtain a molecule with appropriate 

antigen specificity and biological activity, can be used (Morrison, S.L. et al. (1984) Proc. Natl. Acad. 

Sci. USA 81:6851-6855; Neuberger, M.S. et al. (1984) Nature 312:604-608; Takeda, S. et al. (1985) 

Nature 3 14:452-454). Alternatively, techniques described for the production of single chain antibodies 

may be adapted, using methods known in die art, to produce NAAP-specific single chain antibodies. 
10 Antibodies with related specificity, but of distinct idiotypic composition, may be generated by chain 

shuffling from random combinatorial immunoglobulin libraries (Burton, D.R. (1991) Proc. Natl. Acad. 

Sci. USA 88:10134-10137). 

Antibodies may also be produced by inducing in vivo production in the lymphocyte population 

or by screening immunoglobulin libraries or panels of highly specific binding reagents as disclosed in 
15 the literature (Orlandi, R. et al. (1989) Proc. Natl. Acad. Sci. USA 86:3833-3837; Winter, G. et al. 

(1991) Nature 349:293-299). 

Antibody fragments which contain specific binding sites for NAAP may also be generated. 

For example, such fragments include, but are not limited to, F(ab') 2 fragments produced by pepsin 

digestion of the antibody molecule and Fab fragments generated by reducing the disulfide bridges of 
20 the F(ab')2 fragments. Alternatively, Fab expression libraries maybe constructed to allow rapid and 

easy identification of monoclonal Fab fragments with the desired specificity (Huse, W.D. et al. (1989) 

Science 246:1275-1281). 

Various immunoassays maybe used for screening to identify antibodies having the desired 

specificity. Numerous protocols for competitive binding or immunoradiometric assays using either 
25 polyclonal or monoclonal antibodies with established specificities are well known in the art. Such 

immunoassays typically involve the measurement of complex formation between NAAP and its 

specific antibody. A two-site, monoclonal-based immunoassay utilizing monoclonal antibodies reactive 

to two noil-interfering NAAP epitopes is generally used, but a competitive binding assay may also be 

employed (Pound, supra). 

30 Various methods such as Scatchard analysis in conjunction with radioimmunoassay techniques 

may be used to assess the affinity of antibodies for NAAP. Affinity is expressed as an association 
constant, K,, which is defined as the molar concentration of NAAP-antibody complex divided by die 
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molar concentrations of free antigen and free antibody under equilibrium conditions. The K, 
determined for a preparation of polyclonal antibodies, which are heterogeneous in their affinities for 
multiple NAAP epitopes, represents the average affinity, or avidity, of the antibodies for NAAP. The 
K, determined for a preparation of monoclonal antibodies, which are monospecific for a particular 

5 NAAP epitope, represents a true measure of affinity. High-affinity antibody preparations with K, 
ranging from about 10 9 to 10 12 L/mole are preferred for use in immunoassays in which the NAAP- 
antibody complex must withstand rigorous manipulations. Low-affinity antibody preparations with 
ranging from about 10 6 to 10 7 L/mole are preferred for use in immunopurification and similar 
procedures which ultimately require dissociation of NAAP, preferably in active form, from the 

10 antibody (Catty, D. (1988) Antibodies. Volume I: A Practical Approach . IRL Press, Washington DC; 
Liddell, J.E. and A. Cryer (1991) A Practical Guide to Monoclonal Antibodies . John Wiley & Sons, 
New York NY). 

The titer and avidity of polyclonal antibody preparations may be further evaluated to determine 
the quality and suitability of such preparations for certain downstream applications. For example, a 
15 polyclonal antibody preparation containing at least 1-2 mg specific antibody/ml, preferably 5-10 mg 
specific antibody/ml, is generally employed in procedures requiring precipitation of NAAP-antibody 
complexes. Procedures for evaluating antibody specificity, titer, and avidity, and guidelines for 
antibody quality and usage in various applications, are generally available (Catty, supra; Coligan et al., 
supra). 

20 In another embodiment of the invention, polynucleotides encoding NAAP, or any fragment or 

complement thereof, may be used for therapeutic purposes. In one aspect, modifications of gene 
expression can be achieved by designing complementary sequences or antisense molecules (DNA, 
RNA, PNA, or modified oligonucleotides) to the coding or regulatory regions of the gene encoding 
NAAP. Such technology is well known in the art, and antisense oligonucleotides or larger fragments 

25 can be designed from various locations along the coding or control regions of sequences encoding 
NAAP (Agrawal, S., ed. (1996) Antisense Therapeutics , Humana Press, Totawa NJ). 

In therapeutic use, any gene delivery system suitable for introduction of the antisense 
sequences into appropriate target cells can be used. Antisense sequences can be delivered 
intracellularly in the form of an expression plasmid which, upon transcription, produces a sequence 

30 complementary to at least a portion of the cellular sequence encoding the target protein (Slater, J.E. et 
al. (1998) J. Allergy Clin. Immunol. 102:469-475; Scanlon, K.J. et al. (1995) 9:1288-1296). Antisense 
sequences can also be introduced intracellularly through the use of viral vectors, such as retrovirus and 
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adeno-associated vims vectors (Miller, A.D. (1990) Blood 76:271; Ausubel et al., supra; Uckert, W. 
and W. Walther (1994) Pharmacol Ther. 63:323-347). Other gene delivery mechanisms include 
liposome-derived systems, artificial viral envelopes, and other systems known in the art (Rossi, JJ. 
(1995) Br. Med. Bull. 51:217-225; Boado, RJ. et al. (1998) J. Pharm. Sci. 87:1308-1315; Morris, 

5 M.C. et al. (1997) Nucleic Acids Res. 25:2730-2736). 

In another embodiment of the invention, polynucleotides encoding NAAP may be used for 
somatic or germline gene therapy. Gene therapy may be performed to (i) correct a genetic deficiency 
(e.g., in the cases of severe combined immunodeficiency (SCID)-Xl disease characterized by X- 
linked inheritance (Cavazzana-Calvo, M. et al. (2000) Science 288:669-672), severe combined 

10 immunodeficiency syndrome associated with an inherited adenosine deaminase (ADA) deficiency 
(Blaese, R.M. et al. (1995) Science 270:475-480; Bordignon, C. et al. (1995) Science 270:470-475), 
cystic fibrosis (Zabner, J. et al. (1993) Cell 75:207-216; Crystal, R.G. et al. (1995) Hum. Gene 
Therapy 6:643-666; Crystal, R.G. et al. (1995) Hum. Gene Therapy 6:667-703), thalassamias, familial 
hypercholesterolemia, and hemophilia resulting from Factor VIH or Factor DC deficiencies (Crystal, 

15 R.G. (1995) Science 270:404-410; Verma, LM. and N. Somia (1997) Nature 389:239-242)), (ii) 
express a conditionally lethal gene product (e.g., in the case of cancers which result from unregulated 
cell proliferation), or (iii) express a protein which affords protection against intracellular parasites (e.g., 
against human retroviruses, such as human immunodeficiency virus (HIV) (Baltimore, D. (1988) 
Nature 335:395-396; Poeschla, E. et al. (1996) Proc. Natl. Acad. Sci. USA 93:11395-11399), hepatitis 

20 B or C virus (HBV, HCV); fungal parasites, such as Candida albicans and Paracoccidioides 

brasiliensis\ and protozoan parasites such as Plasmodium falciparum and Trypanosoma cruzi). In 
the case where a genetic deficiency in NAAP expression or regulation causes disease, the expression 
of NAAP from an appropriate population of transduced cells may alleviate the clinical manifestations 
caused by the genetic deficiency. 

25 In a further embodiment of the invention, diseases or disorders caused by deficiencies in 

NAAP are treated by constructing mammalian expression vectors encoding NAAP and introducing 
these vectors by mechanical means into NAAP-deficient cells. Mechanical transfer technologies for 
use with cells in vivo or ex vitro include (i) direct DNA microinjection into individual cells, (ii) ballistic 
gold particle delivery, (iii) liposome-mediated transfection, (iv) receptor-mediated gene transfer, and 

30 (v) the use of DNA transposons (Morgan, R.A. and W.F. Anderson (1993) Annu. Rev. Biochem. 
62:191-217; Ivies, Z. (1997) Cell 91:501-510; Boulay, J.-L. and H. Reapon (1998) Curr. Opin. 
Biotechnol. 9:445-450). 
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Expression vectors that may be effective for the expression of NAAP include, but are not 
limited to, the PCDNA 3.1, EPITAG, PRCCMV2 f PREP, PVAX, PCR2-TOPOTA vectors 
(Invitrogen, Carlsbad CA), PCMV-SCRIPT, PCMV-TAG, PEGSH/PERV (Stratagene, La JoUa CA), 
and PTET-OFF, PTET-ON, PTRE2, PTRE2-LUC, PTK-HYG (Clontech, Palo Alto CA). NAAP 

5 may be expressed using (i) a constitutively active promoter, (e.g., from cytomegalovirus (CMV), Rous 
sarcoma virus (RSV), SV40 virus, thymidine kinase (TK), or P-actin genes), (ii) an inducible promoter 
(e.g., the tetracycline-regulated promoter (Gossen, M. and H. Bujard (1992) Proc. Natl. Acad. Sci. 
USA 89:5547-5551; Gossen, M. et al. (1995) Science 268:1766-1769; Rossi, F.M.V. and H.M. Blau 
(1998) Curr. Opin. Biotechnol. 9:451-456), commercially available in the T-REX plasmid (Invitrogen)); 

10 the ecdysone-inducible promoter (available in the plasmids PVGRXR and PIND; Invitrogen); the 
FK506/rapamycin inducible promoter; or the RU486/mifepristone inducible promoter (Rossi, F.M. V. 
and H.M. Blau, supra)), or (iii) a tissue-specific promoter or the native promoter of the endogenous 
gene encoding NAAP from a normal individual. 

Commercially available liposome transformation kits (e.g., the PERFECT LIPID 

15 TRANSFECTION KIT, available from Invitrogen) allow one with ordinary skill in the art to deliver 
polynucleotides to target cells in culture and require minimal effort to optimize experimental 
parameters. In the alternative, transformation is performed using the calcium phosphate method 
(Graham, F.L. and A.J. Eb (1973) Virology 52:456-467), or by electroporation (Neumann, E. et al. 
(1982) EMBO J. 1:841-845). The introduction of DNA to primary cells requires modification of these 

20 standardized mammalian transfection protocols. 

In another embodiment of the invention, diseases or disorders caused by genetic defects with 
respect to NAAP expression are treated by constructing a retrovirus vector consisting of (i) the 
polynucleotide encoding NAAP under the control of an independent promoter or the retrovirus long 
terminal repeat (LTR) promoter, (ii) appropriate RNA packaging signals, and (iii) a Rev-responsive 

25 element (RRE) along with additional retrovirus c/s-acting RNA sequences and coding sequences 
required for efficient vector propagation. Retrovirus vectors (e.g., PFB and PFBNEO) are 
commercially available (Stratagene) and are based on published data (Riviere, I. et al. (1995) Proc. 
Natl. Acad. Sci. USA 92:6733-6737), incorporated by reference herein. The vector is propagated in 
an appropriate vector producing cell line (VPCL) that expresses an envelope gene with a tropism for 

30 receptors on the target cells or a promiscuous envelope protein such as VSVg (Armentano, D. et al. 
(1987) J. Virol. 61:1647-1650; Bender, M.A. et al. (1987) J. Virol. 61:1639-1646; Adam, M.A. and 
A.D. Miller (1988) J. Virol. 62:3802-3806; Dull, T. et al. (1998) J. Virol. 72:8463-8471; Zufferey, R. et 
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al. (1998) J. Virol. 72:9873-9880). U.S. Patent No. 5,910,434 to Rigg ("Method for obtaining 
retrovirus packaging cell lines producing high transducing efficiency retroviral supernatant") discloses 
a method for obtaining retrovirus packaging cell lines and is hereby incorporated by reference. 
Propagation of retrovirus vectors, transduction of a population of cells (e.g., CD4 + T-cells), and the 

5 return of transduced cells to a patient are procedures well known to persons skilled in the art of gene 
therapy and have been well documented (Ranga, U. et al. (1997) J. Virol. 71:7020-7029; Bauer, G. et 
al. (1997) Blood 89:2259-2267; Bonyhadi, M.L. (1997) J. Virol. 71:4707-4716; Ranga, U. et al. (1998) 
Proc. Nad Acad. Sci. USA 95:1201-1206; Su, L. (1997) Blood 89:2283-2290). 

In an embodiment, an adenovirus-based gene therapy delivery system is used to deliver 

10 polynucleotides encoding NAAP to cells which have one or more genetic abnormalities with respect to 
the expression of NAAP. The construction and packaging of adenovirus-based vectors are well 
known to those with ordinary skill in the art. Replication defective adenovirus vectors have proven to 
be versatile for importing genes encoding immunoregulatory proteins into intact islets in the pancreas 
(Csete, M.E. et al. (1995) Transplantation 27:263-268). Potentially useful adenoviral vectors are 

15 described in U.S. Patent No. 5,707,618 to Armentano ("Adenovirus vectors for gene therapy")* 

hereby incorporated by reference. For adenoviral vectors, see also Antinozzi, P.A. et al. (1999; Annu. 
Rev. Nutr. 19:511-544) and Verma, I.M. and N. Somia (1997; Nature 18:389:239-242). 

In another embodiment, a herpes-based, gene therapy delivery system is used to deliver 
polynucleotides encoding NAAP to target cells which have one or more genetic abnormalities with 

20 respect to the expression of NAAP. The use of herpes simplex virus (HSV)-based vectors may be 
especially valuable for introducing NAAP to cells of the central nervous system, for which HSV has a 
tropism. The construction and packaging of herpes-based vectors are well known to those with 
ordinary skill in the art. A replication-competent herpes simplex virus (HSV) type 1 -based vector has 
been used to deliver a reporter gene to the eyes of primates (Liu, X. et al. (1999) Exp. Eye Res. 

25 169:385-395). The construction of a HSV-1 virus vector has also been disclosed in detail in U.S. 
Patent No. 5,804,413 to DeLuca ('Tferpes simplex virus strains for gene transfer"), which is hereby 
incorporated by reference. U.S. Patent No. 5,804,413 teaches the use of recombinant HSV d92 
which consists of a genome containing at least one exogenous gene to be transferred to a cell under 
the control of the appropriate promoter for purposes including human gene therapy. Also taught by 

30 this patent are the construction and use of recombinant HSV strains deleted for ICP4, ICP27 and 
ICP22. For HSV vectors, see also Goins, W.F. et al. (1999; J. Virol. 73:519-532) and Xu, H. et al. 
(1994; Dev. Biol. 163:152-161). The manipulation of cloned herpesvirus sequences, the generation of 
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recombinant vims following the transfection of multiple plasmids containing different segments of the 
large herpesvirus genomes, the growth and propagation of herpesvirus, and the infection of cells with 
herpesvirus are techniques well known to those of ordinary skill in the art. 

In another embodiment, an alphavirus (positive, single-stranded RNA virus) vector is used to 

5 deliver polynucleotides encoding NAAP to target cells. The biology of the prototypic alphavirus, 
Semliki Forest Virus (SFV), has been studied extensively and gene transfer vectors have been based 
on the SFV genome (Garoff, H. and K.-J. Li (1998) Curr. Opin. Biotechnol. 9:464-469). During 
alphavirus RNA replication, a subgenomic RNA is generated that normally encodes the viral capsid 
proteins. This subgenomic RNA replicates to higher levels than the full length genomic RNA, 

10 resulting in the overproduction of capsid proteins relative to the viral proteins with enzymatic activity 
(e.g., protease and polymerase). Similarly, inserting the coding sequence for NAAP into the 
alphavirus genome in place of the capsid-coding region results in the production of a large number of 
NAAP-coding RNAs and the synthesis of high levels of NAAP in vector transduced cells. While 
alphavirus infection is typically associated with cell lysis within a few days, the ability to establish a 

15 persistent infection in hamster normal kidney cells (BHK-21) with a variant of Sindbis virus (SIN) 
indicates that the lytic replication of alphaviruses can be altered to suit the needs of the gene therapy 
application (Dryga, S. A. et aL (1997) Virology 228:74-83). The wide host range of alphaviruses will 
allow the introduction of NAAP into a variety of cell types. The specific transduction of a subset of 
cells in a population may require the sorting of cells prior to transduction. The methods of 

20 manipulating infectious cDNA clones of alphaviruses, performing alphavirus cDNA and RNA 

transfections, and performing alphavirus infections, are well known to those with ordinary skill in the 
art. 

Oligonucleotides derived from the transcription initiation site, e.g., between about positions -10 
and +10 from the start site, may also be employed to inhibit gene expression. Similarly, inhibition can 

25 be achieved using triple helix base-pairing methodology. Triple helix pairing is useful because it causes 
inhibition of the ability of the double helix to open sufficiently for the binding of polymerases, 
transcription factors, or regulatory molecules. Recent therapeutic advances using triplex DNA have 
been described in the literature (Gee, J.E. et aL (1994) in Huber, B.E. and B.I. Carr, Molecular and 
Immunologic Approaches . Futura Publishing, Mt, Kisco NY, pp. 163-177). A complementary 

30 sequence or antisense molecule may also be designed to block translation of mRNA by preventing the 
transcript from binding to ribosomes. 

Ribozymes, enzymatic RNA molecules, may also be used to catalyze the specific cleavage of 
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RNA. The mechanism of ribozyme action involves sequence-specific hybridization of the ribozyme 
molecule to complementary target RNA, followed by endonucleolytic cleavage. For example, 
engineered hammerhead motif ribozyme molecules may specifically and efficiently catalyze 
endonucleolytic cleavage of RNA molecules encoding NAAP. 

5 Specific ribozyme cleavage sites within any potential RNA target are initially identified by 

scanning the target molecule for ribozyme cleavage sites, including the following sequences: GUA, 
GUU, and GUC. Once identified, short RNA sequences of between 15 and 20 ribonucleotides, 
corresponding to the region of the target gene containing the cleavage site, may be evaluated for 
secondary structural features which may render the oligonucleotide inoperable. The suitability of 

10 candidate targets may also be evaluated by testing accessibility to hybridization with complementary 
oligonucleotides using ribonuclease protection assays. 

Complementary ribonucleic acid molecules and ribozymes may be prepared by any method 
known in the art for the synthesis of nucleic acid molecules. TTiese include techniques for chemically 
synthesizing oligonucleotides such as solid phase phosphoramidite chemical synthesis. Alternatively, 

15 RNA molecules may be generated by in vitro and in vivo transcription of DNA molecules encoding 
NAAP. Such DNA sequences may be incorporated into a wide variety of vectors with suitable RNA 
polymerase promoters such as T7 or SP6. Alternatively, these cDNA constructs that synthesize 
complementary RNA, constitutively or inducibly, can be introduced into cell lines, cells, or tissues. 
RNA molecules may be modified to increase intracellular stability and half-life. Possible 

20 modifications include, but are not limited to, the addition of flanking sequences at the 5' and/or 3 ' ends 
of the molecule, or the use of phosphorothioate or 2' O-methyl rather than phosphodiesterase linkages 
within the backbone of the molecule. This concept is inherent in the production of PNAs and can be 
extended in all of these molecules by the inclusion of nontraditional bases such as inosine, queosine, 
and wybutosine, as well as acetyl-, methyl-, thio-, and similarly modified forms of adenine, cytidine, 

25 guanine, thymine, and uridine which are not as easily recognized by endogenous endonucleases. 

In other embodiments of the invention, the expression of one or more selected polynucleotides 
of the present invention can be altered, inhibited, decreased, or silenced using RNA interference 
(RNAi) or post-transcriptional gene silencing (PTGS) methods known in the art RNAi is a post- 
transcriptional mode of gene silencing in which double-stranded RNA (dsRNA) introduced into a 

30 targeted cell specifically suppresses the expression of the homologous gene (Le., the gene bearing the 
sequence complementary to die dsRNA). This effectively knocks out or substantially reduces the 
expression of the targeted gene. PTGS can also be accomplished by use of DNA or DNA fragments 
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as well. RNAi methods are described by Fire, A. et al. (1998; Nature 391:806-811) and Gura, T. 
(2000; Nature 404:804-808). PTGS can also be initiated by introduction of a complementary segment 
of DNA into the selected tissue using gene delivery and/or viral vector delivery methods described 
herein or known in the art. 

5 RNAi can be induced in mammalian cells by the use of small interfering RNA also known as 

siRNA. SiRNA are shorter segments of dsRNA (typically about 21 to 23 nucleotides in length) that 
result in vivo from cleavage of introduced dsRNA by the action of an endogenous ribonuclease. 
SiRNA appear to be the mediators of the RNAi effect in mammals. The most effective siRNAs 
appear to be 21 nucleotide dsRNAs with 2 nucleotide 3* overhangs. The use of siRNA for inducing 

10 RNAi in mammalian cells is described by Elbashir, S.M. et al. (2001 ; Nature 411 :494-498). 

SiRNA can either be generated indirectly by introduction of dsRNA into the targeted cell, or 
directly by mammalian transfection methods and agents described herein or known in the art (such as 
liposome-mediated transfection, viral vector methods, or other polynucleotide delivery/introductory 
methods). Suitable SiRNAs can be selected by examining a transcript of the target polynucleotide 

15 (e.g., mRNA) for nucleotide sequences downstream from the AUG start codon and recording the 
occurrence of each nucleotide and the 3' adjacent 19 to 23 nucleotides as potential siRNA target sites, 
with sequences having a 21 nucleotide length being preferred. Regions to be avoided for target 
siRNA sites include the 5' and 3 'untranslated regions (UTRs) and regions near the start codon (within 
75 bases), as these may be richer in regulatory protein binding sites. UTR-binding proteins and/or 

20 translation initiation complexes may interfere with binding of the siRNP endonuclease complex. The 
selected target sites for siRNA can then be compared to the appropriate genome database (e.g., 
human, etc.) using BLAST or other sequence comparison algorithms known in die art. Target 
sequences with significant homology to other coding sequences can be eliminated from consideration. 
The selected SiRNAs can be produced by chemical synthesis methods known in the art or by in vitro 

25 transcription using commercially available methods and kits such as the SILENCER siRNA 
construction kit (Ambion, Austin TX). 

In alternative embodiments, long-term gene silencing and/or RNAi effects can be induced in 
selected tissue using expression vectors that continuously express siRNA. This can be accomplished 
using expression vectors that are engineered to express hairpin RNAs (shRNAs) using methods 

30 known in the art (see, e.g., Brummelkamp, T.R. et al. (2002) Science 296:550-553; and Paddison, PJ. 
et al. (2002) Genes Dev. 16:948-958). In these and related embodiments, shRNAs can be delivered to 
target cells using expression vectors known in the art An example of a suitable expression vector for 



85 



WO 03/054219 



PCT/US02/41115 



delivery of siRNA is the PSILENCER1.0-U6 (circular) plasnrid (Ambion). Once delivered to the 
target tissue, shRNAs are processed in vivo into siRNA-like molecules capable of carrying out gene- 
specific silencing. 

In various embodiments, the expression levels of genes targeted by RNAi or PTGS methods 

5 can be determined by assays for mRNA and/or protein analysis. Expression levels of the mRNA of a 
targeted gene, can be determined by northern analysis methods using, for example, the 
NORTHERNMAX-GLY kit (Ambion); by microarray methods; by PCR methods; by real time PCR 
methods; and by other RNA/polynucleotide assays known in the art or described herein. Expression 
levels of the protein encoded by the targeted gene can be determined by Western analysis using 

10 standard techniques known in the art. 

An additional embodiment of the invention encompasses a method for screening for a 
compound which is effective in altering expression of a polynucleotide encoding NAAP. Compounds 
which may be effective in altering expression of a specific polynucleotide may include, but are not 
limited to, oligonucleotides, antisense oligonucleotides, triple helix-forming oligonucleotides, 

15 transcription factors and other polypeptide transcriptional regulators, and non-macromolecular 

chemical entities which are capable of interacting with specific polynucleotide sequences. Effective 
compounds may alter polynucleotide expression by acting as either inhibitors or promoters of 
polynucleotide expression. Thus, in the treatment of disorders associated with increased NAAP 
expression or activity, a compound which specifically inhibits expression of the polynucleotide 

20 encoding NAAP may be therapeutically useful, and in the treatment of disorders associated with 
decreased NAAP expression or activity, a compound which specifically promotes expression of the 
polynucleotide encoding NAAP may be therapeutically useful. 

In various embodiments, one or more test compounds may be screened for effectiveness in 
altering expression of a specific polynucleotide. A test compound may be obtained by any method 

25 commonly known in the art, including chemical modification of a compound known to be effective in 
altering polynucleotide expression; selection from an existing, commercially-available or proprietary 
library of naturally-occurring or non-natural chemical compounds; rational design of a compound 
based on chemical and/or structural properties of the target polynucleotide; and selection from a 
library of chemical compounds created combinatorially or randomly. A sample comprising a 

30 polynucleotide encoding NAAP is exposed to at least one test compound thus obtained. The sample 
may comprise, for example, an intact or permeabilized cell, or an in vitro cell-free or reconstituted 
biochemical system. Alterations in the expression of a polynucleotide encoding NAAP are assayed by 
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any method commonly known in the art Typically, the expression of a specific nucleotide is detected 
by hybridization with a probe having a nucleotide sequence complementary to the sequence of the 
polynucleotide encoding NAAP. The amount of hybridization may be quantified, thus forming the 
basis for a comparison of the expression of the polynucleotide both with and without exposure to one 

5 or more test compounds. Detection of a change in the expression of a polynucleotide exposed to a 
test compound indicates that the test compound is effective in altering the expression of the 
polynucleotide. A screen for a compound effective in altering expression of a specific polynucleotide 
can be carried out, for example, using a Schizosaccharomyces pombe gene expression system 
(Atkins, D. et al. (1999) U.S. Patent No. 5,932,435; Arndt, G.M. et al. (2000) Nucleic Acids Res. 

10 28:E15) or a human cell line such as HeLa cell (Clarke, M.L. et al. (2000) Biochem. Biophys. Res. 
Commun. 268:8-13). A particular embodiment of the present invention involves screening a 
combinatorial library of oligonucleotides (such as deoxyribonucleotides, ribonucleotides, peptide nucleic 
acids, and modified oligonucleotides) for antisense activity against a specific polynucleotide sequence 
(Bruice, T.W. et al. (1997) U.S. Patent No. 5,686,242; Bruice, T.W. et al. (2000) U.S. Patent No. 

15 6,022,691). 

Many methods for introducing vectors into cells or tissues are available and equally suitable 
for use in vivo, in vitro, and ex vivo. For ex vivo therapy, vectors may be introduced into stem cells 
taken from the patient and clonally propagated for autologous transplant back into that same patient 
Delivery by transfection, by liposome injections, or by polycationic amino polymers may be achieved 
20 using methods which are well known in the art (Goldman, C.K. et al. (1997) Nat. Biotechnol. 15:462- 
466). 

Any of the therapeutic methods described above may be applied to any subject in need of 
such therapy, including, for example, mammals such as humans, dogs, cats, cows, horses, rabbits, and 
monkeys. 

25 An additional embodiment of the invention relates to the administration of a composition which 

generally comprises an active ingredient formulated with a pharmaceutically acceptable excipient 
Excipients may include, for example, sugars, starches, celluloses, gums, and proteins. Various 
formulations are commonly known and are thoroughly discussed in the latest edition of Remington's 
Pharmaceutical Sciences (Maack Publishing, Easton PA). Such compositions may consist of NAAP, 

30 antibodies to NAAP, and mimetics, agonists, antagonists, or inhibitors of NAAP. 

In various embodiments, the compositions described herein, such as pharmaceutical 
compositions, may be administered by any number of routes including, but not limited to, oral, 
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intravenous, intramuscular, intra-arterial, intramedullary, intrathecal, intraventricular, pulmonary, 
transdermal, subcutaneous, intraperitoneal, intranasal, enteral, topical, sublingual, or rectal means. 

Compositions for pulmonary administration may be prepared in liquid or dry powder form. 
These compositions are generally aerosolized immediately prior to inhalation by the patient In the 

5 case of small molecules (e.g. traditional low molecular weight organic drugs), aerosol delivery of fast- 
acting formulations is well-known in the art. In the case of macromolecules (e.g. larger peptides and 
proteins), recent developments in the field of pulmonary delivery via the alveolar region of the hing 
have enabled the practical delivery of drugs such as insulin to blood circulation (see, e.g., Patton, J.S. 
et al., U.S. Patent No. 5,997,848). Pulmonary delivery allows administration without needle injection, 

10 and obviates the need for potentially toxic penetration enhancers. 

Compositions suitable for use in the invention include compositions wherein the active 
ingredients are contained in an effective amount to achieve the intended purpose. The determination 
of an effective dose is well within the capability of those skilled in the art. 

Specialized forms of compositions maybe prepared for direct intracellular delivery of 

15 macromolecules comprising NAAP or fragments thereof. For example, liposome preparations 

containing a cell-impermeable macromolecule may promote cell fusion and intracellular delivery of the 
macromolecule. Alternatively, NAAP or a fragment thereof may be joined to a short cationic N- 
terminal portion from the HIV Tat-1 protein. Fusion proteins thus generated have been found to 
transduce into the cells of all tissues, including the brain, in a mouse model system (Schwarze, S.R. et 

20 al. (1999) Science 285:1569-1572). 

For any compound, the therapeutically effective dose can be estimated initially either in cell 
culture assays, e.g., of neoplastic cells, or in animal models such as mice, rats, rabbits, dogs, monkeys, 
or pigs. An animal model may also be used to determine the appropriate concentration range and 
route of administration. Such information can then be used to determine useful doses and routes for 

25 administration in humans. 

A therapeutically effective dose refers to that amount of active ingredient, for example 
NAAP or fragments thereof, antibodies of NAAP, and agonists, antagonists or inhibitors of NAAP, 
which ameliorates the symptoms or condition. Therapeutic efficacy and toxicity may be determined 
by standard pharmaceutical procedures in cell cultures or with experimental animals, such as by 

30 calculating the ED 50 (the dose therapeutically effective in 50% of the population) or LD 50 (the dose 
lethal to 50% of the population) statistics. The dose ratio of toxic to therapeutic effects is the 
therapeutic index, which can be expressed as the LD 30 /ED 50 ratio. Compositions which exhibit large 
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therapeutic indices are preferred. The data obtained from cell culture assays and animal studies are 
used to formulate a range of dosage for human use. The dosage contained in such compositions is 
preferably within a range of circulating concentrations that includes the ED 50 with little or no toxicity. 
The dosage varies within this range depending upon the dosage form employed, the sensitivity of the 
5 patient, and the route of administration. 

The exact dosage will be determined by the practitioner, in light of factors related to the 
subject requiring treatment. Dosage and administration are adjusted to provide sufficient levels of the 
active moiety or to maintain the desired effect. Factors which may be taken into account include the 
severity of the disease state, the general health of the subject, the age, weight, and gender of the 

10 subject, time and frequency of administration, drug combination(s), reaction sensitivities, and response 
to therapy. Long-acting compositions may be administered every 3 to 4 days, every week, or 
biweekly depending on the half-life and clearance rate of the particular formulation. 

Normal dosage amounts may vary from about 0.1 }xg to 100,000 //g, up to a total dose of 
about 1 gram, depending upon the route of administration. Guidance as to particular dosages and 

15 methods of delivery is provided in the literature and generally available to practitioners in the art. 
Those skilled in the art will employ different formulations for nucleotides than for proteins or their 
inhibitors. Similarly, delivery of polynucleotides or polypeptides will be specific to particular cells, 
conditions, locations, etc. 
DIAGNOSTICS 

20 In another embodiment, antibodies which specifically bind NAAP may be used for the 

diagnosis of disorders characterized by expression of NAAP, or in assays to monitor patients being 
treated with NAAP or agonists, antagonists, or inhibitors of NAAP. Antibodies useful for diagnostic 
purposes maybe prepared in the same manner as described above for therapeutics. Diagnostic 
assays for NAAP include methods which utilize the antibody and a label to detect NAAP in human 

25 body fluids or in extracts of cells or tissues. The antibodies may be used with or without modification, 
and may be labeled by covalent or non-covalent attachment of a reporter molecule. A wide variety of 
reporter molecules, several of which are described above, are known in the art and may be used. 

A variety of protocols for measuring NAAP, including ELISAs, RIAs, and FACS, are known 
in the art and provide a basis for diagnosing altered or abnormal levels of NAAP expression. Normal 

30 or standard values for NAAP expression are established by combining body fluids or cell extracts 

taken from normal mammalian subjects, for example, human subjects, with antibodies to NAAP under 
conditions suitable for complex formation. The amount of standard complex formation may be 
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quantitated by various methods, such as photometric means. Quantities of NAAP expressed in 
subject, control, and disease samples from biopsied tissues are compared with the standard values. 
Deviation between standard and subject values establishes the parameters for diagnosing disease. 
In another embodiment of the invention, polynucleotides encoding NAAP may be used for 

5 diagnostic purposes. The polynucleotides which may be used include oligonucleotides, complementary 
RNA and DNA molecules, and PNAs. The polynucleotides may be used to detect and quantify gene 
expression in biopsied tissues in which expression of NAAP may be correlated with disease. The 
diagnostic assay may be used to determine absence, presence, and excess expression of NAAP, and 
to monitor regulation of NAAP levels during therapeutic intervention. 

10 In one aspect, hybridization with PCR probes which are capable of detecting polynucleotides, 

including genomic sequences, encoding NAAP or closely related molecules may be used to identify 
nucleic acid sequences which encode NAAP. The specificity of the probe, whether it is made from a 
highly specific region, e.g., the 5'regulatory region, or from a less specific region, e.g., a conserved 
motif, and the stringency of the hybridization or amplification will determine whether the probe 

15 identifies only naturally occurring sequences encoding NAAP, allelic variants, or related sequences. 

Probes may also be used for the detection of related sequences, and may have at least 50% 
sequence identity to any of the NAAP encoding sequences. The hybridization probes of the subject 
invention may be DNA or RNA and may be derived from the sequence of SEQ ID NO:58-l 14 or 
from genomic sequences including promoters, enhancers, and introns of the NAAP gene. 

20 Means for producing specific hybridization probes for polynucleotides encoding NAAP include 

the cloning of polynucleotides encoding NAAP or NAAP derivatives into vectors for the production of 
mRNA probes. Such vectors are known in the art, are commercially available, and may be used to 
synthesize RNA probes in vitro by means of the addition of the appropriate RNA polymerases and 
the appropriate labeled nucleotides. Hybridization probes may be labeled by a variety of reporter 

25 groups, for example, by radionuclides such as 32 P or 33 S, or by enzymatic labels, such as alkaline 
phosphatase coupled to the probe via avidin/biotin coupling systems, and the like. 

Polynucleotides encoding NAAP may be used for the diagnosis of disorders associated with 
expression of NAAP. Examples of such disorders include, but are not limited to, a cell proliferative 
disorder such as actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, cirrhosis, hepatitis, mixed 

30 connective tissue disease (MCTD), myelofibrosis, paroxysmal nocturnal hemoglobinuria, polycythemia 
vera, psoriasis, primary thrombocythemia, and cancers including adenocarcinoma, leukemia, 
lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma, and, in particular, a cancer of the adrenal 
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gland, bladder, bone, bone marrow, brain, breast, cervix, colon, gall bladder, ganglia, gastrointestinal 
tract, heart, kidney, liver, hing, muscle, ovary, pancreas, parathyroid, penis, prostate, salivary glands, 
skin, spleen, testis, thymus, thyroid, and uterus; a neurological disorder such as epilepsy, ischemic 
cerebrovascular disease, stroke, cerebral neoplasms, Alzheimer's disease, Pick's disease, 

5 Huntington's disease, dementia, Parkinson's disease and other extrapyramidal disorders, amyotrophic 
lateral sclerosis and other motor neuron disorders, progressive neural muscular atrophy, retinitis 
pigmentosa, hereditary ataxias, multiple sclerosis and other demyelinating diseases, bacterial and viral 
meningitis, brain abscess, subdural empyema, epidural abscess, suppurative intracranial 
thrombophlebitis, myelitis and radiculitis, viral central nervous system disease, prion diseases including 

10 kuru, Creutzfeldt-Jakob disease, and Gerstmann-Straussler-Scheinker syndrome, fatal familial 
insomnia, nutritional and metabolic diseases of the nervous system, neurofibromatosis, tuberous 
sclerosis, cerebelloretinal hemangioblastomatosis, encephalotrigeminal syndrome, mental retardation 
and other developmental disorder of the central nervous system, cerebral palsy, a neuroskeletal 
disorder, an autonomic nervous system disorder, a cranial nerve disorder, a spinal cord disease, 

15 muscular dystrophy and other neuromuscular disorder, a peripheral nervous system disorder, 

dermatomyositis and polymyositis, inherited, metabolic, endocrine, and toxic myopathy, myasthenia 
gravis, periodic paralysis, a mental disorder including mood, anxiety, and schizophrenic disorder, 
seasonal affective disorder (SAD), akathesia, amnesia, catatonia, diabetic neuropathy, tardive 
dyskinesia, dystonias, paranoid psychoses, postherpetic neuralgia, and Tourette's disorder; a 

20 developmental disorder such as renal tubular acidosis, anemia, Dishing' s syndrome, achondroplastic 
dwarfism, Duchenne and Becker muscular dystrophy, epilepsy, gonadal dysgenesis, WAGR syndrome 
(Wilms' tumor, aniridia, genitourinary abnormalities, and mental retardation), Smith-Magenis syndrome, 
myelodysplastic syndrome, hereditary myoepithelial dysplasia, hereditary keratodermas, hereditary 
neuropathies such as Charcot-Marie-Tooth disease and neurofibromatosis, hypothyroidism, 

25 hydrocephalus, seizure disorders such as Syndenham's chorea and cerebral palsy, spina bifida, 
anencephaly, craniorachischisis, congenital glaucoma, cataract, and sensorineural hearing loss; an 
autoimmune/inflammatory disorder such as acquired immunodeficiency syndrome (AIDS), Addison's 
disease, adult respiratory distress syndrome, allergies, ankylosing spondylitis, amyloidosis, anemia, 
asthma, atherosclerosis, autoimmune hemolytic anemia, autoimmune thyroiditis, autoimmune 

30 polyendocrinopathy-candidiasis-ectodermal dystrophy (APECED), bronchitis, cholecystitis, contact 
dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, diabetes mellitus, emphysema, episodic 
lymphopenia with lymphocytotoxins, erythroblastosis fetalis, erythema nodosum, atrophic gastritis, 
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glomerulonephritis, Goodpasture's syndrome, gout, Graves' disease, Hashimoto's thyroiditis, 
hypereosinophilia, irritable bowel syndrome, multiple sclerosis, myasthenia gravis, myocardial or 
pericardial inflammation, osteoarthritis, osteoporosis, pancreatitis, polymyositis, psoriasis, Reiter's 
syndrome, rheumatoid arthritis, scleroderma, Sjogren's syndrome, systemic anaphylaxis, systemic 

5 lupus erythematosus, systemic sclerosis, thrombocytopenic purpura, ulcerative colitis, uveitis, Werner 
syndrome, complications of cancer, hemodialysis, and extracorporeal circulation, viral, bacterial, 
fungal, parasitic, protozoal, and helminthic infections, and trauma; and an infection, such as those 
caused by a viral agent classified as adenovirus, arenavirus, bunyavirus, caliciviius, coronavirus, 
filovirus, hepadnavirus, herpesvirus, flavivirus, orthomyxovirus, parvovirus, papovavirus, 

10 paramyxovirus, picomavirus, poxvirus, reovirus, retrovirus, rhabdovirus, or togavirus; an infection 
caused by a bacterial agent classified as pneumococcus, staphylococcus, streptococcus, bacillus, 
corynebacterium, Clostridium, meningococcus, gonococcus, listeria, moraxella, kingella, haemophihis, 
legionella, bordetella, gram-negative enterobacterium including shigella, salmonella, or Campylobacter, 
pseudomonas, vibrio, brucella, francisella, yersinia, bartonella, norcardium, actinomyces, 

15 mycobacterium, spirochaetale, rickettsia, chlamydia, or mycoplasma; an infection caused by a fungal 
agent classified as aspergillus, blastomyces, dermatophytes, cryptococcus, coccidioides, malasezzia, 
histoplasma, or other mycosis-causing fungal agent; and an infection caused by a parasite classified as 
Plasmodium or malaria-causing, parasitic entamoeba, leishmania, trypanosoma, toxoplasma, 
Pneumocystis carinii, intestinal protozoa such as giardia, trichomonas, tissue nematode such as 

20 trichinella, intestinal nematode such as ascaris, lymphatic filarial nematode, trematode such as 
schistosoma, and cestode such as tapeworm. Polynucleotides encoding NAAP maybe used in 
Southern or northern analysis, dot blot, or other membrane-based technologies; in PGR technologies; in 
dipstick, pin, and multiformat ELISA-like assays; and in microarrays utilizing fluids or tissues from 
patients to detect altered NAAP expression. Such qualitative or quantitative methods are well known 

25 in the art 

In a particular embodiment, polynucleotides encoding NAAP may be used in assays that 
detect the presence of associated disorders, particularly those mentioned above. Polynucleotides 
complementary to sequences encoding NAAP may be labeled by standard methods and added to a 
fluid or tissue sample from a patient under conditions suitable for the formation of hybridization 
30 complexes. After a suitable incubation period, the sample is washed and the signal is quantified and 
compared with a standard value. If the amount of signal in the patient sample is significantly altered in 
comparison to a control sample then the presence of altered levels of polynucleotides encoding NAAP 
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in the sample indicates the presence of the associated disorder. Such assays may also be used to 
evaluate the efficacy of a particular therapeutic treatment regimen in animal studies, in clinical trials, 
or to monitor the treatment of an individual patient. 

In order to provide a basis for the diagnosis of a disorder associated with expression of 

5 NAAP, a normal or standard profile for expression is established. This may be accomplished by 
combining body fluids or cell extracts taken from normal subjects, either animal or human, with a 
sequence, or a fragment thereof, encoding NAAP, under conditions suitable for hybridization or 
amplification. Standard hybridization may be quantified by comparing the values obtained from normal 
subjects with values from an experiment in which a known amount of a substantially purified 

10 polynucleotide is used. Standard values obtained in this manner may be compared with values 
obtained from samples from patients who are symptomatic for a disorder. Deviation from standard 
values is used to establish the presence of a disorder. 

Once the presence of a disorder is established and a treatment protocol is initiated, 
hybridization assays may be repeated on a regular basis to determine if the level of expression in the 

15 patient begins to approximate that which is observed in the normal subject. The results obtained from 
successive assays may be used to show the efficacy of treatment over a period ranging from several 
days to months. 

With respect to cancer, the presence of an abnormal amount of transcript (either under- or 
overexpressed) in biopsied tissue from an individual may indicate a predisposition for the development 

20 of the disease; or may provide a means for detecting the disease prior to the appearance of actual 
clinical symptoms. A more definitive diagnosis of this type may allow health professionals to employ 
preventative measures or aggressive treatment earlier, thereby preventing the development or further 
progression of the cancer. 

Additional diagnostic uses for oligonucleotides designed from the sequences encoding NAAP 

25 may involve the use of PCR. These oligomers may be chemically synthesized, generated 

enzymatically, or produced in vitro. Oligomers will preferably contain a fragment of a polynucleotide 
encoding NAAP, or a fragment of a polynucleotide complementary to the polynucleotide encoding 
NAAP, and will be employed under optimized conditions for identification of a specific gene or 
condition. Oligomers may also be employed under less stringent conditions for detection or 

30 quantification of closely related DNA or RNA sequences. 

In a particular aspect, oligonucleotide primers derived from polynucleotides encoding NAAP 
may be used to detect single nucleotide polymorphisms (SNPs). SNPs are substitutions, insertions and 



93 



WO 03/054219 



PCT/US02/41115 



deletions that are a frequent cause of inherited or acquired genetic disease in humans. Methods of 
SNP detection include, but are not limited to, single-stranded conformation polymorphism (SSCP) and 
fluorescent SSCP (fSSCP) methods. In SSCP, oligonucleotide primers derived from polynucleotides 
encoding NA AP are used to amplify DNA using the polymerase chain reaction (PCR). The DNA 

5 may be derived, for example, from diseased or normal tissue, biopsy samples, bodily fluids, and the 
like. SNPs in the DNA cause differences in the secondary and tertiary structures of PCR products in 
single-stranded form, and these differences are detectable using gel electrophoresis in non-denaturing 
gels. In fSCCP, the oligonucleotide primers are fluorescently labeled, which allows detection of the 
amplimers in high-throughput equipment such as DNA sequencing machines. Additionally, sequence 

10 database analysis methods, termed in silico SNP (isSNP), are capable of identifying polymorphisms by 
comparing the sequence of individual overlapping DNA fragments which assemble into a common 
consensus sequence. These computer-based methods filter out sequence variations due to laboratory 
preparation of DNA and sequencing errors using statistical models and automated analyses of DNA 
sequence chromatograms. In the alternative, SNPs may be detected and characterized by mass 

15 spectrometry using, for example, the high throughput MASSARRAY system (Sequenom, Inc., San 
Diego CA). 

SNPs may be used to study the genetic basis of human disease. For example, at least 16 
common SNPs have been associated with non-insulin-dependent diabetes mellitus. SNPs are also 
useful for examining differences in disease outcomes in monogenic disorders, such as cystic fibrosis, 

20 sickle cell anemia, or chronic granulomatous disease. For example, variants in the mannose-binding 
lectin, MBL2, have been shown to be correlated with deleterious pulmonary outcomes in cystic 
fibrosis. SNPs also have utility in pharmacogenomics, the identification of genetic variants that 
influence a patient's response to a drug, such as life-threatening toxicity. For example, a variation in 
N-acetyl transferase is associated with a high incidence of peripheral neuropathy in response to the 

25 anti-tuberculosis drug isoniazid, while a variation in the core promoter of the ALOX5 gene results in 
diminished clinical response to treatment with an anti-asthma drug that targets the 5-lipoxygenase 
pathway. Analysis of the distribution of SNPs in different populations is useful for investigating 
genetic drift, mutation, recombination, and selection, as well as for tracing the origins of populations 
and their migrations (Taylor, J.G. et al. (2001) Trends Mol. Med. 7:507-512; Kwok, P.-Y. and Z. Gu 

30 (1999) Mol. Med. Today 5:538-543; Nowotny, P. et al. (2001) Curr. Opin. Neurobiol. 11:637-641). 

Methods which may also be used to quantify the expression of NAAP include radiolabeling or 
biotinylating nucleotides, coamplification of a control nucleic acid, and interpolating results from 
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standard curves (Melby, P.C. et al. (1993) J. Immunol. Methods 159:235-244; Duplaa, C. et al. (1993) 
Anal. Biochem. 212:229-236). The speed of quantitation of multiple samples may be accelerated by 
running the assay in a high-throughput format where the oligomer or polynucleotide of interest is 
presented in various dilutions and a spectrophotometric or colorimetric response gives rapid 
5 quantitation. 

In further embodiments, oligonucleotides or longer fragments derived from any of the 
polynucleotides described herein may be used as elements on a microarray. The micro array can be 
used in transcript imaging techniques which monitor the relative expression levels of large numbers of 
genes simultaneously as described below. The microarray may also be used to identify genetic 

10 variants, mutations, and polymorphisms. This information may be used to determine gene function, to 
understand the genetic basis of a disorder, to diagnose a disorder, to monitor progression/regression of 
disease as a function of gene expression, and to develop and monitor the activities of therapeutic 
agents in the treatment of disease. In particular, this information may be used to develop a 
pharmacogenomic profile of a patient in order to select the most appropriate and effective treatment 

15 regimen for that patient. For example, therapeutic agents which are highly effective and display the 
fewest side effects maybe selected for a patient based on his/her pharmacogenomic profile. 

In another embodiment, NAAP, fragments of NAAP, or antibodies specific for NAAP may 
be used as elements on a microarray. The microarray may be used to monitor or measure protein- 
protein interactions, drug-target interactions, and gene expression profiles, as described above. 

20 A particular embodiment relates to the use of the polynucleotides of the present invention to 

generate a transcript image of a tissue or cell type. A transcript image represents the global pattern of 
gene expression by a particular tissue or cell type. Global gene expression patterns are analyzed by 
quantifying the number of expressed genes and their relative abundance under given conditions and at 
a given time (Seilhamer et al., "Comparative Gene Transcript Analysis," U.S. Patent No. 5,840,484; 

25 hereby expressly incorporated by reference herein). Thus a transcript image may be generated by 
hybridizing the polynucleotides of the present invention or their complements to the totality of 
transcripts or reverse transcripts of a particular tissue or cell type. In one embodiment, the 
hybridization takes place in high-throughput format, wherein the polynucleotides of the present 
invention or their complements comprise a subset of a plurality of elements on a microarray. The 

30 resultant transcript image would provide a profile of gene activity. 

Transcript images may be generated using transcripts isolated from tissues, cell lines, biopsies, 
or other biological samples. The transcript image may thus reflect gene expression in vivo, as in the 
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case of a tissue or biopsy sample, or in vitro, as in the case of a cell line. 

Transcript images which profile the expression of the polynucleotides of the present invention 
may also be used in conjunction with in vitro model systems and preclinical evaluation of 
pharmaceuticals, as well as toxicological testing of industrial and naturally-occurring environmental 

5 compounds. AD compounds induce characteristic gene expression patterns, frequently termed 

molecular fingerprints or toxicant signatures, which are indicative of mechanisms of action and toxicity 
(Nuwaysir, E.F. et al. (1999) Mol. Carcinog. 24:153-159; Steiner, S. and N.L. Anderson (2000) 
Toxicol. Lett. 112-113:467-471). If a test compound has a signature similar to that of a compound 
with known toxicity, it is likely to share those toxic properties. These fingerprints or signatures are 

10 most useful and refined when they contain expression information from a large number of genes and 
gene families. Ideally, a genome-wide measurement of expression provides the highest quality 
signature. Even genes whose expression is not altered by any tested compounds are important as 
well, as the levels of expression of these genes are used to normalize the rest of the expression data. 
The normalization procedure is useful for comparison of expression data after treatment with different 

15 compounds. While the assignment of gene function to elements of a toxicant signature aids in 

interpretation of toxicity mechanisms, knowledge of gene function is not necessary for die statistical 
matching of signatures which leads to prediction of toxicity (see, for example, Press Release 00-02 
from the National Institute of Environmental Health Sciences, released February 29, 2000, available at 
http://www.niehs.nih.gov/oc/news/toxchip.htm). Therefore, it is important and desirable in 

20 toxicological screening using toxicant signatures to include all expressed gene sequences. 

In an embodiment, the toxicity of a test compound can be assessed by treating a biological 
sample containing nucleic acids with the test compound. Nucleic acids that are expressed in the 
treated biological sample are hybridized with one or more probes specific to the polynucleotides of the 
present invention, so that transcript levels corresponding to the polynucleotides of the present invention 

25 may be quantified. The transcript levels in the treated biological sample are compared with levels in 
an untreated biological sample. Differences in the transcript levels between the two samples are 
indicative of a toxic response caused by the test compound in the treated sample. 

Another embodiment relates to the use of the polypeptides disclosed herein to analyze the 
proteome of a tissue or cell type. The term proteome refers to the global pattern of protein expression 

30 in a particular tissue or cell type. Each protein component of a proteome can be subjected individually 
to further analysis. Proteome expression patterns, or profiles, are analyzed by quantifying the number 
of expressed proteins and their relative abundance under given conditions and at a given time. A 
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profile of a cell's proteome may thus be generated by separating and analyzing the polypeptides of a 
particular tissue or cell type. In one embodiment, the separation is achieved using two-dimensional gel 
electrophoresis, in which proteins from a sample are separated by isoelectric focusing in the first 
dimension, and then according to molecular weight by sodium dodecyl sulfate slab gel electrophoresis 

5 in the second dimension (Steiner and Anderson, supra). The proteins are visualized in the gel as 
discrete and uniquely positioned spots, typically by staining the gel with an agent such as Coomassie 
Blue or silver or fluorescent stains. The optical density of each protein spot is generally proportional to 
the level of the protein in the sample. The optical densities of equivalently positioned protein spots 
from different samples, for example, from biological samples either treated or untreated with a test 

10 compound or therapeutic agent, are compared to identify any changes in protein spot density related to 
the treatment. The proteins in the spots are partially sequenced using, for example, standard methods 
employing chemical or enzymatic cleavage followed by mass spectrometry. The identity of the protein 
in a spot may be determined by comparing its partial sequence, preferably of at least 5 contiguous 
amino acid residues, to the polypeptide sequences of interest. In some cases, further sequence data 

15 may be obtained for definitive protein identification. 

A proteomic profile may also be generated using antibodies specific for NAAP to quantify the 
levels of NAAP expression. In one embodiment, the antibodies are used as elements on a microarray, 
and protein expression levels are quantified by exposing the microarray to the sample and detecting 
the levels of protein bound to each array element (Lueking, A. et al. (1999) Anal. Biochem. 270:103- 

20 111; Mendoze, L.G. et al. (1999) Biotechniques 27:778-788). Detection maybe performed by a 

variety of methods known in the art, for example, by reacting the proteins in the sample with a thiol- or 
amino-reactive fluorescent compound and detecting the amount of fluorescence bound at each array 
element. 

Toxicant signatures at the proteome level are also useful for toxicological screening, and 
25 should be analyzed in parallel with toxicant signatures at the transcript level. There is a poor 

correlation between transcript and protein abundances for some proteins in some tissues (Anderson, 
N.L. and J. Seilhamer (1997) Electrophoresis 18:533-537), so proteome toxicant signatures maybe 
useful in the analysis of compounds which do not significantly affect the transcript image, but which 
alter the proteomic profile. In addition, the analysis of transcripts in body fluids is difficult, due to rapid 
30 degradation of mRNA, so proteomic profiling may be more reliable and informative in such cases. 

In another embodiment, the toxicity of a test compound is assessed by treating a biological 
sample containing proteins with the test compound. Proteins that are expressed in the treated 



97 



WO 03/054219 



PCT/US02/41115 



biological sample are separated so that the amount of each protein can be quantified. The amount of 
each protein is compared to the amount of the corresponding protein in an untreated biological sample. 
A difference in the amount of protein between the two samples is indicative of a toxic response to the 
test compound in the treated sample. Individual proteins are identified by sequencing the amino acid 
5 residues of the individual proteins and comparing these partial sequences to the polypeptides of the 
present invention. 

In another embodiment, the toxicity of a test compound is assessed by treating a biological 
sample containing proteins with the test compound. Proteins from the biological sample are incubated 
with antibodies specific to the polypeptides of the present invention. The amount of protein recognized 

10 by the antibodies is quantified. The amount of protein in the treated biological sample is compared 
with the amount in an untreated biological sample. A difference in the amount of protein between the 
two samples is indicative of a toxic response to the test compound in the treated sample. 

Microarrays may be prepared, used, and analyzed using methods known in the art (Brennan, 
T.M. et al. (1995) U.S. Patent No. 5,474,796; Schena, M. et al. (1996) Proc. Natl. Acad. Sci. USA 

15 93:10614-10619; Baldeschweiler et al. (1995) PCT application W095/251116; Shalon, D. et al. (1995) 
PCT application WO95/35505; Heller, R.A. et al. (1997) Proc. Natl. Acad. Sci. USA 94:2150-2155; 
Heller, MJ. et al. (1997) U.S. Patent No. 5,605,662). Various types of microarrays are well known 
and thorougjily described in Schena, M., ed. (1999; DNA Microarrays: A Practical Approach . Oxford 
University Press, London). 

20 In another embodiment of the invention, nucleic acid sequences encoding NAAP may be used 

to generate hybridization probes useful in mapping the naturally occurring genomic sequence. Either 
coding or noncoding sequences maybe used, and in some instances, noncoding sequences may be 
preferable over coding sequences. For example, conservation of a coding sequence among members 
of a multi-gene family may potentially cause undesired cross hybridization during chromosomal 

25 mapping. The sequences may be mapped to a particular chromosome, to a specific region of a 

chromosome, or to artificial chromosome constructions, e.g., human artificial chromosomes (HACs), 
yeast artificial chromosomes (YACs), bacterial artificial chromosomes (BACs), bacterial PI 
constructions, or single chromosome cDNA libraries (Harrington, J.J. et al. (1997) Nat. Genet. 15:345- 
355; Price, CM. (1993) Blood Rev. 7:127-134; Trask, B.J. (1991) Trends Genet. 7:149-154). Once 

30 mapped, the nucleic acid sequences may be used to develop genetic linkage maps, for example, which 
correlate the inheritance of a disease state with the inheritance of a particular chromosome region or 
restriction fragment length polymorphism (RFLP) (Lander, E.S. and D. Botstein (1986) Proc. Natl. 
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Acad. Sci. USA 83:7353-7357). 

Fluorescent in situ hybridization (FISH) may be correlated with other physical and genetic 
map data (Heinz-Ulrich, et al. (1995) in Meyers, supra, pp. 965-968). Examples of genetic map data 
can be found in various scientific journals or at the Online Mendelian Inheritance in Man (OMIM) 

5 World Wide Web site. Correlation between the location of the gene encoding NAAP on a physical 
map and a specific disorder, or a predisposition to a specific disorder, may help define the region of 
DNA associated with that disorder and thus may further positional cloning efforts. 

In situ hybridization of chromosomal preparations and physical mapping techniques, such as 
linkage analysis using established chromosomal markers, may be used for extending genetic maps. 

10 Often the placement of a gene on the chromosome of another mammalian species, such as mouse, 
may reveal associated markers even if the exact chromosomal locus is not known. This information is 
valuable to investigators searching for disease genes using positional cloning or other gene discovery 
techniques. Once the gene or genes responsible for a disease or syndrome have been crudely 
localized by genetic linkage to a particular genomic region, e.g., ataxia-telangiectasia to llq22-23, any 

15 sequences mapping to that area may represent associated or regulatory genes for further investigation 
(Gatti, R.A. et al. (1988) Nature 336:577-580). The nucleotide sequence of the instant invention may 
also be used to detect differences in the chromosomal location due to translocation, inversion, etc., 
among normal, carrier, or affected individuals. 

In another embodiment of the invention, NAAP, its catalytic or immunogenic fragments, or 

20 oligopeptides thereof can be used for screening libraries of compounds in any of a variety of drug 
screening techniques. The fragment employed in such screening may be free in solution, affixed to a 
solid support, borne on a cell surface, or located intracelhilarly. The formation of binding complexes 
between NAAP and the agent being tested may be measured. 

Another technique for drug screening provides for high throughput screening of compounds 

25 having suitable binding affinity to the protein of interest (Geysen, et al. (1984) PCT application 

WO84/03564). In this method, large numbers of different small test compounds are synthesized on a 
solid substrate. The test compounds are reacted with NAAP, or fragments thereof, and washed. 
Bound NAAP is then detected by methods well known in the art. Purified NAAP can also be coated 
directly onto plates for use in the aforementioned drug screening techniques. Alternatively, 

30 non-neutralizing antibodies can be used to capture the peptide and immobilize it on a solid support 
In another embodiment, one may use competitive drug screening assays in which neutralizing 
antibodies capable of binding NAAP specifically compete with a test compound for binding NAAP. 



99 



WO 03/054219 



PCT/US02/41115 



In this manner, antibodies can be used to detect the presence of any peptide which shares one or more 
antigenic determinants with NAAP. 

In additional embodiments, the nucleotide sequences which encode NAAP may be used in 
any molecular biology techniques that have yet to be developed, provided the new techniques rely on 
5 properties of nucleotide sequences that are currently known, including, but not limited to, such 
properties as the triplet genetic code and specific base pair interactions. 

Without further elaboration, it is believed that one skilled in the art can, using the preceding 
description, utilize the present invention to its fullest extent. The following embodiments are, therefore, 
to be construed as merely illustrative, and not limitative of the remainder of the disclosure in any way 
10 whatsoever. 

The disclosures of all patents, applications, and publications mentioned above and below, 
including U.S. Ser. No. 60/343,004, U.S. Ser. No. 60/347,633, U.S. Ser. No. 60/359,498, and U.S. Ser. 
No. 60/351,749, are hereby expressly incorporated by reference. 

15 EXAMPLES 
I. Construction of cDNA Libraries 

Incyte cDNAs were derived from cDNA libraries described in the UFESEQ GOLD 
database (Incyte Genomics, Palo Alto CA). Some tissues were homogenized and lysed in guanidinium 
isothiocyanate, while others were homogenized and lysed in phenol or in a suitable mixture of 

20 denaturants, such as TREOL (Invitrogen), a monophasic solution of phenol and guanidine 
isothiocyanate. The resulting lysates were centrifuged over CsCl cushions or extracted with 
chloroform. RNA was precipitated from the lysates with either isopropanol or sodium acetate and 
ethanol, or by other routine methods. 

Phenol extraction and precipitation of RNA were repeated as necessary to increase RNA 

25 purity. In some cases, RNA was treated with DNase. For most libraries, poly(A)+ RNA was 
isolated using oligo d(T)-coupled paramagnetic particles (Promega), OLIGOTEX latex particles 
(QIAGEN, Chatsworth CA), or an OLIGOTEX mRNA purification kit (QIAGEN). Alternatively, 
RNA was isolated directly from tissue lysates using other RNA isolation kits, e.g., the 
POLY(A)PURE mRNA purification kit (Ambion, Austin TX). 

30 In some cases, Stratagene was provided with RNA and constructed the corresponding cDNA 

libraries. Otherwise, cDNA was synthesized and cDNA libraries were constructed with the 
UNIZAP vector system (Stratagene) or SUPERSCRIPT plasmid system (Invitrogen), using the 
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recommended procedures or similar methods known in the art (Ausubel et al. t supra, ch. 5). Reverse 
transcription was initiated using oligo d(T) or random primers. Synthetic oligonucleotide adapters were 
ligated to double stranded cDNA, and the cDNA was digested with the appropriate restriction enzyme 
or enzymes. For most libraries, the cDNA was size-selected (300-1000 bp) using SEPHACRYL 

5 S 1000, SEPHAROSE CL2B, or SEPHAROSE CUB column chromatography (Amersham 
Biosciences) or preparative agarose gel electrophoresis. cDNAs were ligated into compatible 
restriction enzyme sites of the polylinker of a suitable plasmid, e.g., PBLUESCRIPT plasmid 
(Stratagene), PSPORT1 plasmid (Invitrogen, Carlsbad CA), PCDNA2.1 plasmid (Invitrogen), PBK- 
CMV plasmid (Stratagene), PCR2-TOPOTA plasmid (Invitrogen), PCMV-ICIS plasmid (Stratagene), 

10 pIGEN (Incyte Genomics, Palo Alto CA), pRARE (Incyte Genomics), or pINCY (Incyte Genomics), 
or derivatives thereof. Recombinant plasmids were transformed into competent £. coli cells including 
XLl-Bhie, XLl-BlueMRF, or SOLR from Stratagene or DH5a, DH10B, or ElectroMAX DH10B 
from Invitrogen. 

II. Isolation of cDNA Clones 

15 Plasmids obtained as described in Example I were recovered from host cells by in vivo 

excision using the UNEAP vector system (Stratagene) or by cell lysis. Plasmids were purified using 
at least one of the following: a Magic or WIZARD Minipreps DNA purification system (Promega); an 
AGTC Miniprep purification kit (Edge Biosystems, Gaithersburg MD); and QIAWELL 8 Plasmid, 
QIAWELL 8 Pius Plasmid, QIAWELL 8 Ultra Plasmid purification systems or the RE.A.L. PREP 

20 96 plasmid purification kit from QIAGEN. Following precipitation, plasmids were resuspended in 0. 1 
ml of distilled water and stored, with or without lyophilization, at 4°C. 

Alternatively, plasmid DNA was amplified from host cell lysates using direct link PCR in a 
high-throughput format (Rao, V.B. (1994) Anal. Biochem. 216:1-14). Host cell lysis and thermal 
cycling steps were carried out in a single reaction mixture. Samples were processed and stored in 

25 3 84- well plates, and the concentration of amplified plasmid DNA was quantified fhiorometrically using 
PICOGREEN dye (Molecular Probes, Eugene OR) and a FLUOROSKAN II fluorescence scanner 
(Labsystems Oy, Helsinki, Finland). 

III. Sequencing and Analysis 

Incyte cDNA recovered in plasmids as described in Example II were sequenced as follows. 
30 Sequencing reactions were processed using standard methods or high-throughput instrumentation such 
as the ABI CATALYST 800 (Applied Biosystems) thermal cycler or the PTC-200 thermal cycler 
(MJ Research) in conjunction with the HYDRA microdispenser (Robbins Scientific) or the 
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NQCROLAB 2200 (Hamilton) liquid transfer system. cDNA sequencing reactions were prepared 
using reagents provided by Amersham Biosciences or supplied in ABI sequencing kits such as the 
ABI PRISM BIGDYE Terminator cycle sequencing ready reaction kit (Applied Biosystems). 
Electrophoretic separation of cDNA sequencing reactions and detection of labeled polynucleotides 

5 were carried out using the MEGABACE 1000 DNA sequencing system (Amersham Biosciences); 
the ABI PRISM 373 or 377 sequencing system (Applied Biosystems) in conjunction with standard 
ABI protocols and base calling software; or other sequence analysis systems known in the art 
Reading frames within the cDNA sequences were identified using standard methods (Ausubel et al., 
supra, ch. 7). Some of the cDNA sequences were selected for extension using the techniques 

10 disclosed in Example VIII. 

The polynucleotide sequences derived from Incyte cDNAs were validated by removing 
vector, linker, and poly(A) sequences and by masking ambiguous bases, using algorithms and 
programs based on BLAST, dynamic programming, and dinucleotide nearest neighbor analysis. The 
Incyte cDNA sequences or translations thereof were then queried against a selection of public 

15 databases such as the GenBahk primate, rodent, mammalian, vertebrate, and eukaryote databases, and 
BLOCKS, PRINTS, DOMO, PRODOM; PROTEOME databases with sequences from Homo 
sapiens, Rattus norvegicus, Mus musculus, Caenorhabditis elegans, Saccharomyces cerevisiae, 
Schizosaccharomyces pombe, and Candida albicans (Incyte Genomics, Palo Alto CA); hidden 
Markov model (HMM)-based protein family databases such as PFAM, INCY, and T1GRFAM (Haft, 

20 D.H. et al. (2001) Nucleic Acids Res. 29:41-43); and HMM-based protein domain databases such as 
SMART (Schultz, J. et al. (1998) Proc. Natl. Acad. Sci. USA 95:5857-5864; Letunic, I. et al. (2002) 
Nucleic Acids Res. 30:242-244). (HMM is a probabilistic approach which analyzes consensus 
primary structures of gene families; see, for example, Eddy, S.R. (1996) Curr. Opin. Struct. Biol. 
6:361-365.) The queries were performed using programs based on BLAST, FASTA, BLIMPS, and 

25 HMMER The Incyte cDNA sequences were assembled to produce full length polynucleotide 
sequences. Alternatively, GenBank cDNAs, GenBank ESTs, stitched sequences, stretched 
sequences, or Genscan-predicted coding sequences (see Examples IV and V) were used to extend 
Incyte cDNA assemblages to full length. Assembly was performed using programs based on Phred, 
Fhrap, and Consed, and cDNA assemblages were screened for open reading frames using programs 

30 based on GeneMark, BLAST, and FASTA. The full length polynucleotide sequences were translated 
to derive the corresponding full length polypeptide sequences. Alternatively, a polypeptide may begin 
at any of the methionine residues of the full length translated polypeptide. Full length polypeptide 
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sequences were subsequently analyzed by querying against databases such as the GenBank protein 
databases (genpept), SwissProt, the PROTEOME databases, BLOCKS, PRINTS, DOMO, 
PRODOM, Prosite, hidden Markov model (HMM)-based protein family databases such as PFAM, 
INCY, and TIGRFAM; and HMM-based protein domain databases such as SMART. Full length 

5 polynucleotide sequences are also analyzed using MACDNASIS PRO software (MiraiBio, Alameda 
CA) and LASERGENE software (DNASTAR). Polynucleotide and polypeptide sequence alignments 
are generated using default parameters specified by the CLUSTAL algorithm as incorporated into the 
MEGALIGN multisequence alignment program (DNASTAR), which also calculates the percent 
identity between aligned sequences. 

10 Table 7 summarizes the tools, programs, and algorithms used for the analysis and assembly of 

Incyte cDNA and full length sequences and provides applicable descriptions, references, and threshold 
parameters. The first column of Table 7 shows the tools, programs, and algorithms used, the second 
column provides brief descriptions thereof, the third column presents appropriate references, all of 
which are incorporated by reference herein in their entirety, and the fourth column presents, where 

15 applicable, the scores, probability values, and other parameters used to evaluate the strength of a 
match between two sequences (the higher the score or the lower the probability value, the greater the 
identity between two sequences). 

The programs described above for the assembly and analysis of full length polynucleotide and 
polypeptide sequences were also used to identify polynucleotide sequence fragments from SEQ ID 

20 NO:58-l 14. Fragments from about 20 to about 4000 nucleotides which are useful in hybridization and 
amplification technologies are described in Table 4, column 2. 
IV. Identification and Editing of Coding Sequences from Genomic DNA 

Putative nucleic acid-associated proteins were initially identified by running the Genscan gene 
identification program against public genomic sequence databases (e.g., gbpri and gbhtg). Genscan is 

25 a general-purpose gene identification program which analyzes genomic DNA sequences from a 

variety of organisms (Burge, C and S. Karlin (1997) J. Mol. Biol. 268:78-94; Burge, C. and S. Karlin 
(1998) Curr. Opin. Struct. Biol. 8:346-354). The program concatenates predicted exons to form an 
assembled cDNA sequence extending from a methionine to a stop codon. The output of Genscan is a 
FASTA database of polynucleotide and polypeptide sequences. The maximum range of sequence for 

30 Genscan to analyze at once was set to 30 kb. To determine which of these Genscan predicted cDNA 
sequences encode nucleic acid-associated proteins, the encoded polypeptides were analyzed by 
querying against PFAM models for nucleic acid-associated proteins. Potential nucleic acid-associated 
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proteins were also identified by homology to Incyte cDNA sequences that had been annotated as 
nucleic acid-associated proteins. These selected Genscan-predicted sequences were then compared 
by BLAST analysis to the genpept and gbpri public databases. Where necessary, the Genscan- 
predicted sequences were then edited by comparison to the top BLAST hit from genpept to correct 

5 errors in the sequence predicted by Genscan, such as extra or omitted exons. BLAST analysis was 
also used to find any Incyte cDNA or public cDNA coverage of the Genscan-predicted sequences, 
thus providing evidence for transcription. When Incyte cDNA coverage was available, this 
information was used to correct or confirm the Genscan predicted sequence. Full length 
polynucleotide sequences were obtained by assembling Genscan-predicted coding sequences with 

10 Incyte cDNA sequences and/or public cDNA sequences using the assembly process described in 
Example HI. Alternatively, full length polynucleotide sequences were derived entirely from edited or 
unedited Genscan-predicted coding sequences. 

V. Assembly of Genomic Sequence Data with cDNA Sequence Data 
"Stitched" Sequences 

15 Partial cDNA sequences were extended with exons predicted by the Genscan gene 

identification program described in Example IV. Partial cDNAs assembled as described in Example 
III were mapped to genomic DNA and parsed into clusters containing related cDNAs and Genscan 
exon predictions from one or more genomic sequences. Each cluster was analyzed using an algorithm 
based on graph theory and dynamic programming to integrate cDNA and genomic information, 

20 generating possible splice variants that were subsequently confirmed, edited, or extended to create a 
full length sequence. Sequence intervals in which the entire length of the interval was present on 
more than one sequence in the cluster were identified, and intervals thus identified were considered to 
be equivalent by transitivity. For example, if an interval was present on a cDNA and two genomic 
sequences, then all three intervals were considered to be equivalent This process allows unrelated 

25 but consecutive genomic sequences to be brought together, bridged by cDNA sequence. Intervals 
thus identified were then "stitched" together by the stitching algorithm in the order that they appear 
along their parent sequences to generate the longest possible sequence, as well as sequence variants. 
Linkages between intervals which proceed along one type of parent sequence (cDNA to cDNA or 
genomic sequence to genomic sequence) were given preference over linkages which change parent 

30 type (cDNA to genomic sequence). The resultant stitched sequences were translated and compared 
by BLAST analysis to the genpept and gbpri public databases. Incorrect exons predicted by Genscan 
were corrected by comparison to the top BLAST hit from genpept. Sequences were further extended 
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with additional cDNA sequences, or by inspection of genomic DNA, when necessary. 
"Stretched" Sequences 

Partial DNA sequences were extended to full length with an algorithm based on BLAST 
analysis. First, partial cDNAs assembled as described in Example HI were queried against public 

5 databases such as the GenBank primate, rodent, mammalian, vertebrate, and eukaryote databases 
using the BLAST program. The nearest GenBank protein homolog was then compared by BLAST 
analysis to either Incyte cDNA sequences or GenScan exon predicted sequences described in 
Example IV. A chimeric protein was generated by using the resultant high-scoring segment pairs 
(HSPs) to map the translated sequences onto the GenBank protein homolog. Insertions or deletions 

10 may occur in the chimeric protein with respect to the original GenBank protein homolog. The 

GenBank protein homolog, the chimeric protein, or both were used as probes to search for homologous 
genomic sequences from the public human genome databases. Partial DNA sequences were 
therefore "stretched" or extended by the addition of homologous genomic sequences. The resultant 
stretched sequences were examined to determine whether it contained a complete gene. 

15 VI. Chromosomal Mapping of NAAP Encoding Polynucleotides 

The sequences which were used to assemble SEQ ID NO:58-l 14 were compared with 
sequences from the Incyte LIFESEQ database and public domain databases using BLAST and other 
implementations of the Smith- Waterman algorithm. Sequences from these databases that matched 
SEQ ID NO:58-114 were assembled into clusters of contiguous and overlapping sequences using 

20 assembly algorithms such as Phrap (Table 7). Radiation hybrid and genetic mapping data available 
from public resources such as the Stanford Human Genome Center (SHGC), Whitehead Institute for 
Genome Research (WIGR), and G£n&hon were used to determine if any of the clustered sequences 
had been previously mapped. Inclusion of a mapped sequence in a cluster resulted in the assignment 
of all sequences of that cluster, including its particular SEQ ID NO:, to that map location. 

25 Map locations are represented by ranges, or intervals, of human chromosomes. The map 

position of an interval, in centiMorgans, is measured relative to the terminus of the chromosome's p- 
arm. (The centiM organ (cM) is a unit of measurement based on recombination frequencies between 
chromosomal markers. On average, 1 cM is roughly equivalent to 1 megabase (Mb) of DNA in 
humans, although this can vary widely due to hot and cold spots of recombination.) The cM distances 

30 are based on genetic markers mapped by G6n£thon which provide boundaries for radiation hybrid 
markers whose sequences were included in each of the clusters. Human genome maps and other 
resources available to the public, such as the NCBI "GeneMap'99" World Wide Web site 
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(http://www.ncbi.nlm.nih.gov/genemap/), can be employed to determine if previously identified disease 
genes map within or in proximity to the intervals indicated above. 
VTI. Analysis of Polynucleotide Expression 

Northern analysis is a laboratory technique used to detect the presence of a transcript of a 
5 gene and involves the hybridization of a labeled nucleotide sequence to a membrane on which RNAs 
from a particular cell type or tissue have been bound (Sambrook and Russell, supra, ch. 7; Ausubel et 
al., supra, ch. 4). 

Analogous computer techniques applying BLAST were used to search for identical or related 
molecules in databases such as GenBank or LIFESEQ (Incyte Genomics). This analysis is much 
10 faster than multiple membrane-based hybridizations. In addition, the sensitivity of the computer search 
can be modified to determine whether any particular match is categorized as exact or similar. The 
basis of the search is the product score, which is defined as: 

BLAST Score x Percent Identity 
15 5 x minimum {length(Seq. 1), length(Seq. 2)} 

The product score takes into account both the degree of similarity between two sequences and the 
length of the sequence match. The product score is a normalized value between 0 and 100, and is 
calculated as follows: the BLAST score is multiplied by the percent nucleotide identity and the 

20 product is divided by (5 times the length of the shorter of the two sequences). The BLAST score is 
calculated by assigning a score of +5 for every base that matches in a high-scoring segment pair 
(HSP), and -4 for every mismatch. Two sequences may share more than one HSP (separated by 
gaps). If there is more than one HSP, then the pair with the highest BLAST score is used to calculate 
the product score. The product score represents a balance between fractional overlap and quality in a 

25 BLAST alignment. For example, a product score of 100 is produced only for 100% identity over the 
entire length of the shorter of the two sequences being compared. A product score of 70 is produced 
either by 100% identity and 70% overlap at one end, or by 88% identity and 100% overlap at the 
other. A product score of 50 is produced either by 100% identity and 50% overlap at one end, or 79% 
identity and 100% overlap. 

30 Alternatively, polynucleotides encoding NAAP are analyzed with respect to the tissue sources 

from which they were derived. For example, some full length sequences are assembled, at least in 
part, with overlapping Incyte cDNA sequences (see Example III). Each cDNA sequence is derived 
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from a cDNA library constructed from a human tissue. Each human tissue is classified into one of the 
following organ/tissue categories: cardiovascular system; connective tissue; digestive system; 
embryonic structures; endocrine system; exocrine glands; genitalia, female; genitalia, male; germ ceDs; 
hemic and immune system; liver; musculoskeletal system; nervous system; pancreas; respiratory 

5 system; sense organs; skin; stomatognathic system; unclassified/mixed; or urinary tract The number 
of libraries in each category is counted and divided by the total number of libraries across all 
categories. Similarly, each human tissue is classified into one of the following disease/condition 
categories: cancer, cell line, developmental, inflammation, neurological, trauma, cardiovascular, pooled, 
and other, and the number of libraries in each category is counted and divided by the total number of 

10 libraries across all categories. The resulting percentages reflect the tissue- and disease-specific 
expression of cDNA encoding NAAP. cDNA sequences and cDNA library/tissue information are 
found in the LEFESEQ GOLD database (Incyte Genomics, Palo Alto CA). 
Vlll. Extension of NAAP Encoding Polynucleotides 

Full length polynucleotides are produced by extension of an appropriate fragment of the full 

15 length molecule using oligonucleotide primers designed from this fragment. One primer was 

synthesized to initiate 5* extension of the known fragment, and the other primer was synthesized to 
initiate 3 * extension of the known fragment The initial primers were designed using OLIGO 4.06 
software (National Biosciences), or another appropriate program, to be about 22 to 30 nucleotides in 
length, to have a GC content of about 50% or more, and to anneal to the target sequence at 

20 temperatures of about 68 °C to about 72 °C. Any stretch of nucleotides which would result in hairpin 
structures and primer-primer dimerizations was avoided. 

Selected human cDNA libraries were used to extend the sequence. If more than one 
extension was necessary or desired, additional or nested sets of primers were designed. 

High fidelity amplification was obtained by PCR using methods well known in the art PCR 

25 was performed in 96- well plates using the PTC-200 thermal cycler (MJ Research, Inc.). The reaction 
mix contained DNA template, 200 nmol of each primer, reaction buffer containing Mg 2+ , (NH^SO*, 
and 2-mercaptoethanol, Taq DNA polymerase (Amersham Biosciences), ELONGASE enzyme 
(Invitrogen), and Pfu DNA polymerase (Stratagene), with the following parameters for primer pair 
PCI A and PCI B: Step 1: 94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60°C, 1 min; Step 4: 68°C, 2 

30 min; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 68°C, 5 min; Step 7: storage at 4°C. In the 
alternative, the parameters for primer pair T7 and SK+ were as follows: Step 1: 94 °C, 3 min; Step 2: 
94 °C, 15 sec; Step 3: 57°C, 1 min; Step 4: 68°C, 2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; 



107 



WO 03/054219 



PCT/US02/41115 



Step 6: 68°C t 5 min; Step 7: storage at 4°C. 

The concentration of DNA in each well was determined by dispensing 100 jzl PICOGREEN 
quantitation reagent (0.25% (v/v) PICOGREEN; Molecular Probes, Eugene OR) dissolved in IX TE 
and 0.5 \i\ of undiluted PGR product into each well of an opaque fluorimeter plate (Corning Costar, 
5 Acton MA), allowing the DNA to bind to the reagent The plate was scanned in a Fluoroskan II 
(Labsystems Oy, Helsinki, Finland) to measure the fluorescence of the sample and to quantify the 
concentration of DNA. A 5 fA to 10 fj\ aliquot of the reaction mixture was analyzed by 
electrophoresis on a 1 % agarose gel to determine which reactions were successful in extending the 
sequence. 

10 The extended nucleotides were desalted and concentrated, transferred to 384-well plates, 

digested with CviJI cholera virus endonuclease (Molecular Biology Research, Madison WI), and 
sonicated or sheared prior to religation into pUC 18 vector (Amersham Biosciences). For shotgun 
sequencing, the digested nucleotides were separated on low concentration (0.6 to 0.8%) agarose gels, 
fragments were excised, and agar digested with Agar ACE (Promega). Extended clones were 

15 religated using T4 ligase (New England Biolabs, Beverly MA) into pUC 18 vector (Amersham 

Biosciences), treated with Pfu DNA polymerase (Stratagene) to fill-in restriction site overhangs, and 
transfected into competent E. coli cells. Transformed cells were selected on antibiotic-containing 
media, and individual colonies were picked and cultured overnight at 37 °C in 384-well plates in LB/2x 
carb liquid media. 

20 The cells were lysed, and DNA was amplified by PCR using Taq DNA polymerase 

(Amersham Biosciences) and Pfu DNA polymerase (Stratagene) with the following parameters: Step 
1: 94 °C, 3 min; Step 2: 94 °C, 15 sec; Step 3: 60°C, 1 min; Step 4: 72°C, 2 min; Step 5: steps 2, 3, and 
4 repeated 29 times; Step 6: 72°C, 5 min; Step 7: storage at 4°C DNA was quantified by 
PICOGREEN reagent (Molecular Probes) as described above. Samples with low DNA recoveries 

25 were reamplified using the same conditions as described above. Samples were diluted with 20% 
dimethysulfoxide (1:2, v/v), and sequenced using DYENAMIC energy transfer sequencing primers 
and the DYENAMIC DIRECT kit (Amersham Biosciences) or the ABI PRISM BIGDYE 
Terminator cycle sequencing ready reaction kit (Applied Biosystems). 

In like manner, full length polynucleotides are verified using the above procedure or are used 

30 to obtain 5' regulatory sequences using the above procedure along with oligonucleotides designed for 
such extension, and an appropriate genomic library. 

IX. Identification of Single Nucleotide Polymorphisms in NAAP Encoding 
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Polynucleotides 

Common DNA sequence variants known as single nucleotide polymorphisms (SNPs) were 
identified in SEQ ID NO:58-l 14 using the LEFESEQ database (Incyte Genomics). Sequences from 
the same gene were clustered together and assembled as described in Example IE, allowing the 

5 identification of all sequence variants in the gene. An algorithm consisting of a series of filters was 
used to distinguish SNPs from other sequence variants. Preliminary filters removed the majority of 
basecall errors by requiring a minimum Phred quality score of 15, and removed sequence alignment 
errors and errors resulting from improper trimming of vector sequences, chimeras, and splice variants. 
An automated procedure of advanced chromosome analysis analysed the original chromatogram files 

10 in the vicinity of the putative SNP. Clone error filters used statistically generated algorithms to identify 
errors introduced during laboratory processing, such as those caused by reverse transcriptase, 
polymerase, or somatic mutation. Clustering error filters used statistically generated algorithms to 
identify errors resulting from clustering of close homologs or pseudogenes, or due to contamination by 
non-human sequences. A final set of filters removed duplicates and SNPs found in immunoglobulins 

15 or T-cell receptors. 

Certain SNPs were selected for further characterization by mass spectrometry using the high 
throughput MASSARRAY system (Sequenom, Inc.) to analyze allele frequencies at the SNP sites in 
four different human populations. The Caucasian population comprised 92 individuals (46 male, 46 
female), including 83 from Utah, four French, three Venezualan, and two Amish individuals. The 

20 African population comprised 194 individuals (97 male, 97 female), all African Americans. The 
Hispanic population comprised 324 individuals (162 male, 162 female), all Mexican Hispanic. The 
Asian population comprised 126 individuals (64 male, 62 female) with a reported parental breakdown 
of 43% Chinese, 31% Japanese, 13% Korean, 5% Vietnamese, and 8% other Asian. Allele 
frequencies were first analyzed in the Caucasian population; in some cases those SNPs which showed 

25 no allelic variance in this population were not further tested in the other three populations. 
X. Labeling and Use of Individual Hybridization Probes 

Hybridization probes derived from SEQ ID NO:58-114 are employed to screen cDNAs, 
genomic DNAs, or mRNAs. Although the labeling of oligonucleotides, consisting of about 20 base 
pairs, is specifically described, essentially the same procedure is used with larger nucleotide 

30 fragments. Oligonucleotides are designed using state-of-the-art software such as OLIGO 4.06 
software (National Biosciences) and labeled by combining 50 pmol of each oligomer, 250 ^Ci of 
[y- 32 P] adenosine triphosphate (Amersham Biosciences), and T4 polynucleotide kinase (DuPont NEN, 



109 



WO 03/054219 



PCTYUS02/41115 



Boston MA). The labeled oligonucleotides are substantially purified using a SEPHADEX G-25 
superfine size exclusion dextran bead column (Amersham Biosciences). An aliquot containing 10 7 
counts per minute of the labeled probe is used in a typical membrane-based hybridization analysis of 
human genomic DNA digested with one of the following endonucleases: Ase I, Bgl n, Eco RI, Pst I, 

5 Xba I, or Pvu II (DuPont NEN). 

The DNA from each digest is fractionated on a 0.7% agarose gel and transferred to nylon 
membranes (Nytran Plus, Schleicher & Schuell, Durham NH). Hybridization is carried out for 16 
hours at 40 °C. To remove nonspecific signals, blots are sequentially washed at room temperature 
under conditions of up to, for example, 0.1 x saline sodium citrate and 0.5% sodium dodecyl sulfate. 

10 Hybridization patterns are visualized using autoradiography or an alternative imaging means and 
compared. 

XI. Microarrays 

The linkage or synthesis of array elements upon a microarray can be achieved utilizing 
photolithography, piezoelectric printing (ink-jet printing; see, e.g., Baldeschweiler et al., supra), 

15 mechanical microspotting technologies, and derivatives thereof. The substrate in each of the 

aforementioned technologies should be uniform and solid with a non-porous surface (Schena, M., ed. 
(1999) DNA Microarrays: A Practical Approach , Oxford University Press, London). Suggested 
substrates include silicon, silica, glass slides, glass chips, and silicon wafers. Alternatively, a procedure 
analogous to a dot or slot blot may also be used to arrange and link elements to the surface of a 

20 substrate using thermal, UV, chemical, or mechanical bonding procedures. A typical array may be 
produced using available methods and machines well known to those of ordinary skill in the art and 
may contain any appropriate number of elements (Schena, M. et al. (1995) Science 270:467-470; 
Shalon, D. et al. (1996) Genome Res. 6:639-645; Marshall, A. and J. Hodgson (1998) Nat Biotechnol. 
16:27-31). 

25 Full length cDNAs, Expressed Sequence Tags (ESTs), or fragments or oligomers thereof may 

comprise the elements of the microarray. Fragments or oligomers suitable for hybridization can be 
selected using software well known in the art such as LASERGENE software (DNASTAR). The 
array elements are hybridized with polynucleotides in a biological sample. The polynucleotides in the 
biological sample are conjugated to a fluorescent label or other molecular tag for ease of detection. 

30 After hybridization, nonhybridized nucleotides from the biological sample are removed, and a 
fluorescence scanner is used to detect hybridization at each array element. Alternatively, laser 
desorbtion and mass spectrometry may be used for detection of hybridization. The degree of 
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complementarity and the relative abundance of each polynucleotide which hybridizes to an element on 
the microarray may be assessed. In one embodiment, microarray preparation and usage is described 
in detail below. 

Tissue or Cell Sample Preparation 

5 Total RNA is isolated from tissue samples using the guanidinium thiocyanate method and 

poly(A) + RNA is purified using the oligo-(dT) cellulose method. Each poly(A) + RNA sample is 
reverse transcribed using MMLV reverse-transcriptase, 0.05 pg//il oligo-(dT) primer (21mer), IX first 
strand buffer, 0.03 units/Ml RNase inhibitor, 500 /iM dATP, 500 /xM dGTP, 500 /iM dTTP, 40 /iM 
dCTP, 40 nM dCTP-Cy3 (BDS) or dCTP-Cy5 (Amersham Biosciences). The reverse transcription 

10 reaction is performed in a 25 ml volume containing 200 ng poly(A) + RNA with GEMBRIGHT kits 
(Incyte Genomics). Specific control poly(A) + RNAs are synthesized by in vitro transcription from 
non-coding yeast genomic DNA. After incubation at 37° C for 2 hr, each reaction sample (one with 
Cy3 and another with Cy5 labeling) is treated with 2.5 ml of 0.5M sodium hydroxide and incubated for 
20 minutes at 85° C to the stop the reaction and degrade the RNA. Samples are purified using two 

15 successive CHROMA SPIN 30 gel filtration spin columns (Clontech, Palo Alto CA) and after 

combining, both reaction samples are ethanol precipitated using 1 ml of glycogen (1 mg/ml), 60 ml 
sodium acetate, and 300 ml of 100% ethanol. The sample is then dried to completion using a 
SpeedVAC (Savant Instruments Inc., Holbrook NY) and resuspended in 14 jd 5X SSC/0.2% SDS. 
Microarray Preparation 

20 Sequences of the present invention are used to generate array elements. Each array element 

is amplified from bacterial cells containing vectors with cloned cDNA inserts. PCR amplification uses 
primers complementary to the vector sequences flanking the cDNA insert. Array elements are 
amplified in thirty cycles of PCR from an initial quantity of 1-2 ng to a final quantity greater than 5 /ig. 
Amplified array elements are then purified using SEPHACRYL-400 (Amersham Biosciences). 

25 Purified array elements are immobilized on polymer-coated glass slides. Glass microscope 

slides (Corning) are cleaned by ultrasound in 0.1% SDS and acetone, with extensive distilled water 
washes between and after treatments. Glass slides are etched in 4% hydrofluoric acid (VWR 
Scientific Products Corporation (VWR), West Chester PA), washed extensively in distilled water, and 
coated with 0.05% aminopropyl silane (Sigma) in 95% ethanol Coated slides are cured in a 110°C 

30 oven. 

Array elements are applied to the coated glass substrate using a procedure described in U.S. 
Patent No. 5,807,522, incorporated herein by reference. 1 fi\ of the array element DNA, at an average 
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concentration of 100 ng//il, is loaded into the open capillary printing element by a high-speed robotic 
apparatus. The apparatus then deposits about 5 nl of array element sample per slide. 

Micro arrays are UV-crosslinked using a STRATALINKER UV-crosslinker (Stratagene). 
Microarrays are washed at room temperature once in 0.2% SDS and three times in distilled water. 
5 Non-specific binding sites are blocked by incubation of microarrays in 0.2% casein in phosphate 

buffered saline (PBS) (Tropix, Inc., Bedford MA) for 30 minutes at 60° C followed by washes in 0.2% 

SDS and distilled water as before. 

Hybridization 

Hybridization reactions contain 9 /zl of sample mixture consisting of 0.2 jig each of Cy3 and 
10 Cy5 labeled cDNA synthesis products in 5X SSC, 0.2% SDS hybridization buffer. The sample 

mixture is heated to 65° C for 5 minutes and is aliquoted onto the microarray surface and covered with 
an 1.8 cm 2 coverslip. The arrays are transferred to a waterproof chamber having a cavity just slightly 
larger than a microscope slide. The chamber is kept at 100% humidity internally by the addition of 140 
/xl of 5X SSC in a corner of the chamber. The chamber containing the arrays is incubated for about 
15 6.5 hours at 60° C The arrays are washed for 10 min at 45° C in a first wash buffer (IX SSC, 0. 1% 
SDS), three times for 10 minutes each at 45° C in a second wash buffer (0.1X SSC), and dried. 
Detection 

Reporter-labeled hybridization complexes are detected with a microscope equipped with an 
Innova 70 mixed gas 10 W laser (Coherent, Inc., Santa Clara CA) capable of generating spectral lines 

20 at 488 nm for excitation of Cy3 and at 632 nm for excitation of Cy5. The excitation laser light is 
focused on the array using a 20X microscope objective (Nikon, Inc., Melville NY). The slide 
containing the array is placed on a computer-controlled X-Y stage on the microscope and raster- 
scanned past the objective. The 1.8 cm x 1.8 cm array used in the present example is scanned with a 
resolution of 20 micrometers. 

25 In two separate scans, a mixed gas multiline laser excites the two fluorophores sequentially. 

Emitted light is split, based on wavelength, into two photomultiplier tube detectors (PMT R1477, 
Hamamatsu Photonics Systems, Bridgewater NJ) corresponding to the two fluorophores. Appropriate 
filters positioned between the array and the photomultiplier tubes are used to filter the signals. The 
emission maxima of the fluorophores used are 565 nm for Cy3 and 650 nm for Cy5. Each array is 

30 typically scanned twice, one scan per fluorophore using the appropriate filters at the laser source, 
although the apparatus is capable of recording the spectra from both fluorophores simultaneously. 
The sensitivity of the scans is typically calibrated using the signal intensity generated by a 
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cDNA control species added to the sample mixture at a known concentration. A specific location on 
the array contains a complementary DN A sequence, allowing the intensity of the signal at that location 
to be correlated with a weight ratio of hybridizing species of 1:100,000. When two samples from 
different sources (e.g., representing test and control cells), each labeled with a different fluorophore, 

5 are hybridized to a single array for the purpose of identifying genes that are differentially expressed, 
the calibration is done by labeling samples of the calibrating cDNA with the two fluorophores and 
adding identical amounts of each to the hybridization mixture. 

The output of the photomultiplier tube is digitized using a 12-bit RTI-835H analog-to-digital 
(A/D) conversion board (Analog Devices, Inc., Norwood MA) installed in an IBM-compatible PC 

10 computer. The digitized data are displayed as an image where the signal intensity is mapped using a 
linear 20-color transformation to a pseudocolor scale ranging from blue (low signal) to red (high 
signal). The data is also analyzed quantitatively. Where two different fluorophores are excited and 
measured simultaneously, the data are first corrected for optical crosstalk (due to overlapping emission 
spectra) between the fluorophores using each fluorophore's emission spectrum. 

15 A grid is superimposed over the fluorescence signal image such that the signal from each spot 

is centered in each element of the grid. The fluorescence signal within each element is then integrated 
to obtain a numerical value corresponding to the average intensity of the signal. The software used 
for signal analysis is the GEMTOOLS gene expression analysis program (Incyte Genomics). Array 
elements that exhibit at least about a two-fold change in expression, a signal-to-background ratio of at 

20 least about 2.5, and an element spot size of at least about 40%, are considered to be differentially 
expressed. 
Expression 

For example, both SEQ ID NO:62 and SEQ ID NO:71 showed differential expression 
in cancer cell lines or tumorous tissue versus non-cancerous cell lines or tissues, as determined by 

25 microarray analysis. The expression of SEQ ID NO:62 was decreased by at least two-fold in ovarian 
tumor tissue from a 79 year-old female donor as compared to matched normal ovarian tissue from the 
same donor. Matched normal and tumorigenic ovarian tissue samples were provided by the Huntsman 
Cancer Institute, (Salt Lake City, UT). Therefore, SEQ ED NO:62 is useful in diagnostic assays for 
ovarian cancer. The expression of SEQ ID NO:71 was increased by at least two-fold in lung 

30 squamous cell carcinoma tissue as compared to matched normal lung tissue from the same donor. 
Matched normal and tumorigenic hing tissue samples were provided by the Roy Castle International 
Centre for Lung Cancer Research (Liverpool, UK). Therefore, SEQ ID NO:71 is useful in diagnostic 
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assays for lung cancer. 

In addition, the expression of SEQ ID NO:71 was increased at least two-fold in Tangier 
disease-derived fibroblasts compared to normal fibroblasts. Both types of cells were also cultured in 
the presence of cholesterol and compared with the same cell type cultured in the absence of 

5 cholesterol. Human fibroblasts were obtained from skin explants from both normal subjects and two 
patients with homozygous Tangier disease. Cell lines were immortalized by transfection with human 
papillomavirus 16 genes E6 and E7 and a neomycin resistance selectable marker. TD derived cells 
are deficient in an assay of apoA-I mediated tritiated cholesterol efflux. Therefore, SEQ ID NO:7 1 is 
also useful in diagnostic assays for Tangier disease. 

10 In an alternative example, SEQ ID NO:79 and SEQ ID NO:81 were differentially expressed 

in human breast tumor cells. Histological and molecular evaluation of breast tumors reveals that the 
development of breast cancer evolves through a multi-step process whereby pre-malignant mammary 
epithelial cells undergo a relatively defined sequence of events leading to tumor formation. An early 
event in tumor development is ductal hyperplasia. Cells undergoing rapid neoplastic growth gradually 

15 progress to invasive carcinoma and become metastatic to the lung, bone, and potentially other organs. 
Several variables that may influence the process of tumor progression and malignant transformation 
include genetic factors, environmental factors, growth factors, and hormones. Based on the 
complexity of this process, it is critical to study a population of human mammary epithelial cells 
undergoing the process of malignant transformation, and to associate specific stages of progression 

20 with phenotypic and molecular characteristics. Primary breast epithelial cells (HMECs) were 
compared to various breast carcinoma lines at various stages of tumor progression. 

Gene expression profiles of nonmalignant mammary epithelial cells were compared to gene 
expression profiles of various breast carcinoma lines at different stages of tumor progression. The 
cells were grown in defined serum-free H14 medium to 70-80% confluence prior to RNA harvest. 

25 Cell lines compared included: a) HMEC, a primary breast epithelial cell line isolated from a normal 
donor, b)MCF-10A, a breast mammary gland cell line isolated from a 36-year-old woman with 
fibrocystic breast disease, c)MCF7, a nonmalignant breast adenocarcinoma cell line isolated from the 
pleural effusion of a 69- year-old female, d)T-47D, a breast carcinoma cell line isolated from a pleural 
effusion obtained from a 54-year-old female with an infiltrating ductal carcinoma of the breast, e)Sk- 

30 BR-3, a breast adenocarcinoma cell line isolated from a malignant pleural effusion of a 43-year-old 
female, f)BT-20, a breast carcinoma ceD line derived in vitro from cells emigrating out of thin slices of 
the tumor mass isolated from a 74-year-old female, g)MDA-mb-231, a breast tumor cell line isolated 
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from the pleural effusion of a 51-year-old female, and h)MD A-mb-43 5 S , a spindle-shaped strain that 
evolved from the parent line (435) isolated by R. Cailleau from pleural effusion of a 31-year-old 
female with metastatic, ductal adenocarcinoma of the breast. 

In various microarray experiments, both SEQ ID NO:79 and SEQ ID NO:81 were 
5 underexpressed by at least two-fold in the following tumor cell lines: MCF7 (nonmalignant breast 
adenocarcinoma), T47D (breast carcinoma), BT-20 (breast carcinoma), MDA-mb-231 (breast tumor 
cell line which also expresses the Wnt3 oncogene, EGF, and TGF-alpha), and MDA-mb-435S 
(spindle-shaped strain of breast tumor cell evolved from a metastatic, ductal adenocarcinoma cell of 
the breast). 

10 These experiments indicate that SEQ ID NO:79 and SEQ ID NO:81 exhibited significant 

differential expression patterns using microarray techniques, and further establish their utility as 
diagnostic markers, disease staging or therapeutic agents which may be useful in a variety of 
conditions and diseases involving nucleic acid-associated proteins, including breast cancer. 

In an alternative example, SEQ ID NO:90, SEQ ID NO:91, and SEQ ID NO:93 showed 

IS differential expression in tumorous or diseased tissue versus non-tumorous or healthy tissues, as 
determined by microarray analysis. Array elements that exhibited about at least a two-fold change in 
expression and a signal intensity over 250 units, a signal-to-background ratio of a least 2.5, and an 
element spot size of at least 40% were identified as differentially expressed using the GEMTOOLS 
program (Incyte Genomics). 

20 For example, SEQ ID NO:90 showed increased expression in lung carcinoma tissue versus 

normal lung tissue as determined by microarray analysis. In one experiment, grossly uninvolved lung 
tissue from a 66 year-old male was compared to lung squamous cell carcinoma tissue from the same 
donor (Roy Castle International Centre for Lung Cancer Research, Liverpool, UK). In a second 
experiment, grossly uninvolved lung tissue from a 66 year-old female was compared to lung 

25 adenocarcinoma tissue from the same donor (Roy Castle International Centre for Lung Cancer 
Research, Liverpool, UK). Therefore, SEQ ID NO:90 is useful in monitoring treatment of, and 
diagnostic assays for, lung cancer and other cell proliferative disorders. 

In another example, SEQ ID NO:90 showed differential expression in amygdala enterorhinal 
cortex tissue versus pooled brain tissue as determined by microarray analysis. Specific brain regions 

30 from 4 individual male donors (47, 48, 59, and 60 years old) were compared to a pooled brain control. 
The pooled brain control was reconstituted from the purified mRNA isolated from the major regions of 
the brain from two male brains (the 47-year-old male and the 48-year-old male). Therefore, SEQ ID 
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NO:90 is useful in monitoring treatment of, and diagnostic assays for, Alzheimer's disease and other 
neurological disorders. 

In another example, SEQ ID NO:91 showed decreased expression in breast carcinoma cell 
lines versus a primary breast epithelial cell line isolated from a normal donor, as determined by 
5 micro array analysis. The gene expression profile of a nonmalignant mammary epithelial cell line was 
compared to the gene expression profiles of breast carcinoma lines at different stages of tumor 
progression. Cell lines compared included: a) BT-20, a breast carcinoma cell line derived in vitro 
from the cells emigrating out of thin slices of tumor mass isolated from a 74-year-old female, b) BT- 
474, a breast ductal carcinoma cell line that was isolated from a solid, invasive ductal carcinoma of the 

10 breast obtained from a 60-year-old woman, c) BT-483, a breast ductal carcinoma cell line that was 
isolated from a papillary invasive ductal tumor obtained from a 23-year-old normal, menstruating, 
parous female with a family history of breast cancer, d) Hs S78T, a breast ductal carcinoma cell line 
isolated from a 74-year-old female with breast carcinoma, e) MCF7, a nonmalignant breast 
adenocarcinoma cell line isolated from the pleural effusion of a 69-year-old female, f) MCF-10A, a 

15 breast mammary gland (luminal ductal characteristics) cell line isolated, from a 36-year-old woman 
with fibrocystic breast disease, g) MDA-MB-468, a breast adenocarcinoma cell line isolated from the 
pleural effusion of a 51 -year-old female with metastatic adenocarcinoma of the breast, and h) HMEC, 
a primary breast epithelial cell line isolated from a normal donor. SEQ ID NO:91 showed decreased 
expression in lines a) through e) above. Therefore, SEQ ID NO:91 is useful in monitoring treatment 

20 of, and diagnostic assays for, breast cancer and other cell proliferative disorders. 

In another example, SEQ ID NO:93 showed decreased expression in breast carcinoma cells 
treated with estradiol versus untreated breast carcinoma cells as determined by microarray analysis. 
BT-20 is a breast carcinoma cell line derived in vitro from the cells emigrating out thin slices of the 
tumor mass isolated from a 74-year-old female. BT-20 cells were treated with a-estradiol for 4, 8, 14, 

25 24, 36, and 48 hours. These treated cells were compared to untreated BT-20 cells kept in culture for 
the same amount of time. Therefore, SEQ ID NO:93 is useful in monitoring treatment of, and 
diagnostic assays for, breast cancer and other cell proliferative disorders. 

In another example, SEQ ID NO:93 showed increased expression in coronary artery epithelial 
cells versus pooled vascular endothelial cells as determined by microarray analysis. Human vascular 

30 and microvascular endothelial cells were isolated from coronary artery, umbilical artery and vein, 

uterine microvasculature, iliac artery, dermal micro vascu lature, pulmonary artery, aorta. AD cell types 
in this experiment are used as experimental models for investigating the role of endothelium in human 
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vascular biology. Therefore, SEQ ID NO:93 is useful in monitoring treatment of, and diagnostic 
assays for, cardiovascular and other autoimmune/inflammation disorders. 

In an alternative example, SEQ ID NO:101, SEQ ID NO:103, SEQ ID NO:106, SEQ ID 
NO:112, and SEQ ID NO:113 showed differential expression in tumorous tissues versus non-tumorous 

5 tissues or in treated versus untreated cell lines, as determined by microarray analysis. Array elements 
that exhibited about at least a two-fold change in expression and a signal intensity over 250 units, a 
signal-to-background ratio of a least 2.0, and an element spot size of at least 40% were identified as 
differentially expressed using the GEMTOOLS program (Incyte Genomics). 

For example, the expression of SEQ ID NO: 101 was decreased at least two-fold in an ovarian 

10 adenocarcinoma when matched with normal tissue from the same donor. The tumorous ovary tissue 
was obtained from ovarian adenocarcinoma from a 79-year-old female. Normal ovary tissue was 
obtained from ovaiy from the same donor. Therefore, SEQ ID NO:101 is useful in diagnostic assays 
for ovarian adenocarcinoma. Matched normal and tumorigenic ovary tissue samples were provided by 
&e Huntsman Cancer Institute, (Salt Lake City, UT). 

15 The gene expression profile of a nonmalignant mammary epithelial cell line (MCF-10A) or a 

nonmalignant mammary epithelial cell line (HMEC) was compared to the gene expression profiles of 
breast carcinoma lines at different stages of tumor progression. Cell lines compared included: a) 
HMEC, a primary breast epithelial cell line isolated from a normal donor, b) MCF-10A, a breast 
mammary gland cell line isolated from a 36-year-old woman with fibrocystic breast disease; c)MCF7, 

20 a nonmalignant breast adenocarcinoma cell line isolated from the pleural effusion of a 69- year-old 
female; d)T-47D, a breast carcinoma cell line isolated from a pleural effusion obtained from a 54- 
year-old female with an infiltrating ductal carcinoma of the breast; e)Sk-BR-3, a breast 
adenocarcinoma cell line isolated from a malignant pleural effusion of a 43-year-old female; f)BT-20, 
a breast carcinoma cell line derived La vitro from tumor mass isolated from a 74-year-old female; 

25 g)MDA-mb-23 1, a breast tumor cell line isolated from the pleural effusion of a 5 1 -year old female; 
and h) MDA-mb-435S, a spindle shaped strain that evolved from the parent line (435) isolated from 
the pleural effusion of a 3 1-year-old female with metastatic, ductal adenocarcinoma of the breast. All 
cell lines were grown in the supplier's recommended medium to 70-80% confluence prior to 
comparison. The expression of SEQ ID NO:101 was decreased at least two-fold in three (MCF7, T- 

30 47D, and MDA-mb-23 1) of the seven cell lines when compared with HMEC or MCF-10A cell lines. 
These experiments indicate that SEQ ID NO: 101 was significantly under-expressed in the breast 
tumor cell lines tested, establishing the utility of SEQ ID NO: 101 as a diagnostic marker for disease 
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staging or as a potential therapeutic target for breast cancer. 

In another example, SEQ ID NO: 103, exhibited differential expression by microarray analysis. 
Early confluent C3 A cells were treated with mifepristone, progesterone, beclomethasone, 
medroxyprogesterone, budesonide, prednisone, dexamethasone, betamethasone, or danazol at 

5 concentrations of 1 fiM, 10 ^iM, and 100 jiM for 1, 3, and 6 hours. In all cases mRNA from untreated 
early confluent C3A cells were prepared in parallel as described below. The expression of SEQ ID 
NO: 103 was decreased at least two-fold in HCA3 cells when treated with Progesterone, 
Beclomethasone, Medroxyprogesterone, Bedusonide, Prednisone, Dexamethasone, and 
Betamethasone at doses between 1/xM and 100 ptM for between 1 and 6 hours, and with Catechol phis 

10 water at 100/iM for 1, 3, and 6 hours. Therefore, SEQ ID NO:103 is useful as a diagnostic marker for 
disease staging or as a potential therapeutic target for liver disorders such as adenomatosis, 
cholestasis, cirrhosis, hemangioma, Henoch-Schonlein purpura, hepatitis, hepatocellular and metastatic 
carcinomas, idiopathic thrombocytopenic purpura, porphyria, sarcoidosis, and Wilson disease. 

In another example, the expression of SEQ ID NO: 106 was increased at least four-fold in 

15 lung squamous cell carcinoma when matched with normal tissue from the same donor. The tumorous 
tissue was obtained from lung squamous cell carcinoma from a 68-year-old female. Normal lung 
tissue was obtained from the same donor. Therefore, SEQ ID NO: 106 is useful in diagnostic assays 
for lung squamous cell carcinoma, in disease staging and as a potential therapeutic target. Matched 
normal and tumorigenic lung tissue samples are provided by the Roy Castle International Centre for 

20 Lung Cancer Research (Liverpool UK). Further, the expression of SEQ ID NO: 106 was decreased 
at least three-fold in colon tumor when matched with normal tissue from the same donor. Tumorous 
colon tissue was obtained from a 67-year-old donor with moderately differentiated colon 
adenocarcinoma. Normal tissue was obtained from the colon of the same donor. Therefore, SEQ ID 
NO: 106 is useful as a diagnostic marker, for disease staging, or as a potential therapeutic target for 

25 colon adenocarcinoma. Matched normal and tumorigenic colon tissue samples were provided by the 
Huntsman Cancer Institute (Salt Lake City, UT). 

In another example, the expression of SEQ ID NO: 112 was decreased at least two-fold in 
two colon adenocarcinomas when matched with normal tissue from the same donor. The tumorous 
tissue was obtained from a 64-year-old female with moderately differentiated colon adenocarcinoma 

30 and from an 83 -year old female with colon adenocarcinoma. Normal tissues were obtained from the 
colons of the same donors. Therefore, SEQ ID NO:l 12 is useful as a diagnostic marker, for disease 
staging, or as a potential therapeutic target for colon adenocarcinoma. Matched normal and 
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tumorigenic colon tissue samples are provided by the Huntsman Cancer Institute (Salt Lake City, UT). 

In yet another example, SEQ ID NO:113 showed differential expression in inflammatory 
responses as determined by microarray analysis. The expression of SEQ ID NO:l 13 was increased 
by at least two-fold in three cell lines, Daudi (a B lymphoblast cell line [Burkitt's lymphoma]), Jurkat 

5 (an acute T cell leukemia cell line that grows in the absence of external stimuli), and THP-1 (a 
promonocyte cell line isolated from the peripheral blood of a 1-year-old male with acute monocytic 
leukemia). These three cell lines are used extensively for the study of signaling in human T cells and 
B cells. These cell lines were treated with various concentrations of PMA (a broad activator of 
protein kinase C-dependent (PKC) pathways), with or without Ionomycin (a calcium ionophore that 

10 permits the entry of calcium in the cell), with or without Brefeldin A (an antiviral antibiotic factor 

produced by Penicillium brefeldianunO . with or without antibodies to CD3 or CD8, or with or without 
LPS (lipopolysaccharide). The expression of SEQ ED NO: 113 was increased by at least two-fold in 
three tumorous cell lines from breast (MDA-mb-231, T-47D; and MCF7) when treated with 1 or 10 
ng/mlofTNFa. See descriptions supra . The expression of SEQ ID NO: 113 was also increased by at 

15 least 2.9-fold in DU 145, a prostate carcinoma cell line isolated from a metastatic site in the brain of a 
69-year-old male with widespread metastatic prostate carcinoma, when compared to Prec, a primary 
prostate epithelial cell line isolated from a normal donor; and increased by at least two-fold in the 
following endothelial cell lines: ECV304 (a cell line derived from die endothelium of the human 
umbilical vein) treated with PMA and Ionomycin or TNFa; HIAEC (primary cells derived from the 

20 endothelium of an iliac artery); HMVEC (primary cells derived from microvascular of human skin 
endothelial cells); HUAEC (primary cells derived from the endothelium of an umbilical artery); and 
UtMVECMyo (primary cells derived from the uterine myometrium microvasculature) all treated with 
10 ng/ml TNF-a when compared to untreated cells of the same cell line; and HUVEC (primary cells 
derived from the endothelium of the human umbilical vein) treated with TNFa and cycloheximide 

25 when compared to untreated HUVEC cells treated with cycloheximide. 

Further, the expression of SEQ ED NO:l 13 was decreased by at least two-fold in HepG2 (a 
human hepatoma cell line isolated from a 15-year-old male with a liver tumor) when treated with 
TNFa. SEQ ID NO: 1 13 was decreased at least two-fold in five out of six breast tumor cell lines 
(BT20, HMEC, MDA-mb-231, MDA-mb-435S, T-47D; see descriptions supra) when compared with 

30 MCF10A cell line. The expression of SEQ ID NO:l 13 was decreased at least 2.5-fold in a colon 
cancer when matched with normal tissue from the same donor. The tumouous colon tissue was 
obtained from a 73-year-old female with colon cancer. Normal colon tissue was obtained from the 
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colon of the same donor. Also, the expression of SEQ ID NO: 113 was decreased at least two-fold in 
a donor with lung adenocarcinoma and decreased at least 2.5-fold in four donors with lung squamous 
cell carcinoma. The adenocarcinoma tissue was obtained from a 66-year-old female with hing 
adenocarcinoma. The squamous cell carcinoma tissue was obtained from a 73-year-old male, a 68- 

5 year-old female, a 66-year-old male, and a 73-year-old male. Normal hing tissues were obtained from 
the lungs of the same donors. All matched normal and tumorous lung tissue samples were provided by 
the Roy Castle International Centre for Lung Cancer Research (Liverpool UK). Therefore, SEQ ID 
NO: 113 is useful in monitoring treatment of, and diagnostic assays for, autoimmune/inflammation 
disorders, breast cancer, colon cancer, and hing cancer. 

10 XII. Complementary Polynucleotides 

Sequences complementary to the NAAP-encoding sequences, or any parts thereof, are used 
to detect, decrease, or inhibit expression of naturally occurring N A AP. Although use of 
oligonucleotides comprising from about 15 to 30 base pairs is described, essentially the same 
procedure is used with smaller or with larger sequence fragments. Appropriate oligonucleotides are 

15 designed using OUGO 4.06 software (National Biosciences) and the coding sequence of NAAP. To 
inhibit transcription, a complementary oligonucleotide is designed from the most unique 5' sequence 
and used to prevent promoter binding to the coding sequence. To inhibit translation, a complementary 
oligonucleotide is designed to prevent ribosomal binding to the NAAP-encoding transcript 
XIII. Expression of NAAP 

20 Expression and purification of NAAP is achieved using bacterial or virus-based expression 

systems. For expression of NAAP in bacteria, cDNA is subcloned into an appropriate vector 
containing an antibiotic resistance gene and an inducible promoter that directs high levels of cDNA 
transcription. Examples of such promoters include, but are not limited to, the trp-lac (jtac) hybrid 
promoter and the T5 or T7 bacteriophage promoter in conjunction with the lac operator regulatory 

25 element Recombinant vectors are transformed into suitable bacterial hosts, e.g., BL21(DE3). 
Antibiotic resistant bacteria express NAAP upon induction with isopropyl beta-D- 
thiogalactopyranoside (IPTG). Expression of NAAP in eukaryotic cells is achieved by infecting insect 
or mammalian cell lines with recombinant Autographica californica nuclear polyhedrosis virus 
(AcMNPV), commonly known as baculo virus. The nonessential polyhedrin gene of baculovirus is 

30 replaced with cDNA encoding NAAP by either homologous recombination or bacterial-mediated 
transposition involving transfer plasmid intermediates. Viral infectivity is maintained and the strong 
polyhedrin promoter drives high levels of cDNA transcription. Recombinant baculovirus is used to 
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infect Spodoptera frugiperda (Sf9) insect ceDs in most cases, or human hepatocytes, in some cases. 
Infection of the latter requires additional genetic modifications to baculo virus (Engelhard, E.K. et al. 
(1994) Proc. Natl. Acad. Sci. USA 91:3224-3227; Sandig, V. et al. (1996) Hum. Gene Ther. 7:1937- 
1945). 

5 In most expression systems, NAAP is synthesized as a fusion protein with, e.g., glutathione S- 

transferase (GST) or a peptide epitope tag, such as FLAG or 6-His, permitting rapid, single-step, 
affinity-based purification of recombinant fusion protein from crude cell lysates. GST, a 26-kilodalton 
enzyme from Schistosoma japonicum, enables the purification of fusion proteins on immobilized 
glutathione under conditions that maintain protein activity and antigenicity (Amersham Biosciences). 

10 Following purification, the GST moiety can be proteolytically cleaved from NAAP at specifically 
engineered' sites. FLAG, an 8-amino acid peptide, enables immunoaffinity purification using 
commercially available monoclonal and polyclonal anti-FLAG antibodies (Eastman Kodak). 6-His, a 
stretch of six consecutive histidine residues, enables purification on metal-chelate resins (Q1AGEN). 
Methods for protein expression and purification are discussed in Ausubel et al. (supra, ch. 10 and 16). 

15 Purified NAAP obtained by these methods can be used directly in the assays shown in Examples 
XVII, XVEI, and XK, where applicable. 
XIV. Functional Assays 

NAAP function is assessed by expressing the sequences encoding NAAP at physiologically 
elevated levels in mammalian cell culture systems. cDNA is subcloned into a mammalian expression 

20 vector containing a strong promoter that drives high levels of cDNA expression. Vectors of choice 
include PCMV SPORT plasmid (Invitrogen, Carlsbad CA) and PCR3.1 plasmid (Invitrogen), both of 
which contain the cytomegalovirus promoter. 5-10 peg of recombinant vector are transiently 
transfected into a human cell line, for example, an endothelial or hematopoietic cell line, using either 
liposome formulations or electroporation. 1-2 fig of an additional plasmid containing sequences 

25 encoding a marker protein are co-transfected. Expression of a marker protein provides a means to 
distinguish transfected cells from nontransfected cells and is a reliable predictor of cDNA expression 
from the recombinant vector. Marker proteins of choice include, e.g., Green Fluorescent Protein 
(GFP; Clontech), CD64, or a CD64-GFP fusion protein. Flow cytometry (FCM), an automated, laser 
optics-based technique, is used to identify transfected cells expressing GFP or CD64-GFP and to 

30 evaluate the apoptotic state of the cells and other cellular properties. FCM detects and quantifies the 
uptake of fluorescent molecules that diagnose events preceding or coincident with cell death. These 
events include changes in nuclear DNA content as measured by staining of DNA with propidium 
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iodide; changes in cell size and granularity as measured by forward light scatter and 90 degree side 
light scatter; down-regulation of DNA synthesis as measured by decrease in bromodeoxyuridine 
uptake; alterations in expression of cell surface and intracellular proteins as measured by reactivity 
with specific antibodies; and alterations in plasma membrane composition as measured by the binding 

5 of fluorescein-conjugated Annexin V protein to the cell surface. Methods in flow cytometry are 
discussed in Ormerod, M.G. (1994; Flow Cytometry . Oxford, New York NY). 

The influence of NAAP on gene expression can be assessed using highly purified populations 
of cells transfected with sequences encoding NAAP and either CD64 or CD64-GFP. CD 64 and 
CD64-GFP are expressed on the surface of transfected cells and bind to conserved regions of human 

10 immunoglobulin G (IgG). Transfected cells are efficiently separated from nontransfected cells using 
magnetic beads coated with either human IgG or antibody against CD64 (DYNAL, Lake Success 
NY). mRNA can be purified from the cells using methods well known by those of skill in the art. 
Expression of mRNA encoding NAAP and other genes of interest can be analyzed by northern 
analysis or micro array techniques. 

15 XV. Production of NAAP Specific Antibodies 

NAAP substantially purified using polyacrylamide gel electrophoresis (PAGE; see, e.g., 
Harrington, M.G. (1990) Methods Enzymol. 182:488-495), or other purification techniques, is used to 
immunize animals (e.g., rabbits, mice, etc.) and to produce antibodies using standard protocols. 
Alternatively, the NAAP amino acid sequence is analyzed using LASERGENE software 

20 (DNASTAR) to determine regions of high immunogenicity, and a corresponding oligopeptide is 
synthesized and used to raise antibodies by means known to those of skill in the art. Methods for 
selection of appropriate epitopes, such as those near the C-terminus or in hydrophilic regions are well 
described in the art (Ausubel et al., supra, ch. 11). 

Typically, oligopeptides of about IS residues in length are synthesized using an ABI 431 A 

25 peptide synthesizer (Applied Biosystems) using FMOC chemistry and coupled to KLH (Sigma- 
Aldrich, St. Louis MO) by reaction with N-mdeimidobenzoyl-N-hydroxysuccinimide ester (MBS) to 
increase immunogenicity (Ausubel et al., supra). Rabbits are immunized with the oligopeptide-KLH 
complex in complete Freund's adjuvant. Resulting antisera are tested for antipeptide and anti-NAAP 
activity by, for example, binding the peptide or NAAP to a substrate, blocking with 1% BS A, reacting 

30 with rabbit antisera, washing, and reacting with radio-iodinated goat anti-rabbit IgG. 
XVI. Purification of Naturally Occurring NAAP Using Specific Antibodies 

Naturally occurring or recombinant NAAP is substantially purified by immunoaffinity 
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chromatography using antibodies specific for NAAP. An immunoaffinity column is constructed by 
covalently coupling anti-NAAP antibody to an activated chromatographic resin, such as 
CNBr-activated SEPHAROSE (Amersham Biosciences). After the coupling, the resin is blocked and 
washed according to the manufacturer's instructions. 

5 Media containing NAAP are passed over the immunoaffinity column, and the column is 

washed under conditions that allow the preferential absorbance of NAAP (e.g., high ionic strength 
buffers in the presence of detergent). The column is ehited under conditions that disrupt 
antibody/NAAP binding (e.g., a buffer of pH 2 to pH 3, or a high concentration of a chaotrope, such 
as urea or thiocyanate ion), and NAAP is collected. 

10 XVII. Identification of Molecules Which Interact with NAAP 

NAAP, or biologically active fragments thereof, are labeled with I25 I Bolton-Hunter reagent 
(Bolton, A.R and W.M. Hunter (1973) Biochem. J. 133:529-539). Candidate molecules previously 
arrayed in the wells of a multi-well plate are incubated with the labeled NAAP, washed, and any wells 
with labeled NAAP complex are assayed. Data obtained using different concentrations of NAAP are 

15 used to calculate values for the number, affinity, and association of NAAP with the candidate 
molecules. 

Alternatively, molecules interacting with NAAP are analyzed using the yeast two-hybrid 
system as described in Fields, S. and O. Song (1989; Nature 340:245-246), or using commercially 
available kits based on the two-hybrid system, such as the MATCHMAKER system (Clontech). 

20 NAAP may also be used in the PATHCALLING process (CuraGen Corp., New Haven CT) 

which employs the yeast two-hybrid system in a high-throughput manner to determine all interactions 
between the proteins encoded by two large libraries of genes (Nandabalan, K. et al. (2000) U.S. 
Patent No. 6,057,101). 
XVIIL Demonstration of NAAP Activity 

25 NAAP activity is measured by its ability to stimulate transcription of a reporter gene (Liu, 

H.Y. et al. (1997) EMBO J. 16:5289-5298). The assay entails the use of a well characterized 
reporter gene construct, LexA op -LacZ, that consists of LexA DNA transcriptional control elements 
(LexA^) fused to sequences encoding the E. coli LacZ enzyme. The methods for constructing and 
expressing fusion genes, introducing them into cells, and measuring LacZ enzyme activity, are well 

30 known to those skilled in the art. Sequences encoding NAAP are cloned into a plasmid that directs 
the synthesis of a fusion protein, LexA-NAAP, consisting of NAAP and a DNA binding domain 
derived from the LexA transcription factor. The resulting plasmid, encoding a LexA-NAAP fusion 

123 



WO 03/054219 



PC17US02/41115 



protein, is introduced into yeast cells along with a plasmid containing the LexA op -LacZ reporter gene. 
The amount of LacZ enzyme activity associated with LexA-NAAP transfected cells, relative to 
control cells, is proportional to the amount of transcription stimulated by the NAAP. 

Alternatively, NAAP activity is measured by its ability to bind zinc. A 5-10 \iM sample 

5 solution in 2.5 mM ammonium acetate solution at pH 7.4 is combined with 0.05 M zinc sulfate solution 
(Aldrich, Milwaukee WI) in the presence of 100 nM dithiothreitol with 10% methanol added. The 
sample and zinc sulfate solutions are allowed to incubate for 20 minutes. The reaction solution is 
passed through a V YD AC column (Grace Vydac, Hesperia, CA) with approximately 300 Angstrom 
bore size and 5 |iM particle size to isolate zinc-sample complex from the solution, and into a mass 

10 spectrometer (PE Sciex, Ontario, Canada). Zinc bound to sample is quantified using the functional 
atomic mass of 63.5 Da observed by Whittal, R. M. et al. ((2000) Biochemistry 39:8406-8417). 

In the alternative, a method to determine nucleic acid binding activity of NAAP involves a 
polyacrylamide gel mobility-shift assay. In preparation for this assay, NAAP is expressed by 
transforming a mammalian cell line such as COS7, HeLa or CHO with a eukaryotic expression vector 

15 containing NAAP cDNA. The cells are incubated for 48-72 hours after transformation under 
conditions appropriate for the cell line to allow expression and accumulation of NAAP. Extracts 
containing sohibilized proteins can be prepared from cells expressing NAAP by methods well known 
in the art. Portions of the extract containing NAAP are added to [ 32 P]-labeled RNA or DNA. 
Radioactive nucleic acid can be synthesized in vitro by techniques well known in the art. The 

20 mixtures are incubated at 25 °C in the presence of RNase- and DNase-inhibitors under buffered 
conditions for 5-10 minutes. After incubation, the samples are analyzed by polyacrylamide gel 
electrophoresis followed by autoradiography. The presence of a band on the autoradiogram indicates 
the formation of a complex between NAAP and the radioactive transcript. A band of similar mobility 
will not be present in samples prepared using control extracts prepared from untransformed cells. 

25 In the alternative, a method to determine methylase activity of NAAP measures transfer of 

radiolabeled methyl groups between a donor substrate and an acceptor substrate. Reaction mixtures 
(50 \i\ final volume) contain 15 mM HEPES, pH 7.9, 1.5 mM MgC^, 10 mM dithiothreitol, 3% 
polyvinylalcohol, 1.5 /xCi [me//ry/- 3 H)AdoMet (0.375 \xM AdoMet) (DuPont-NEN), 0.6 fig NAAP, and 
acceptor substrate (e.g., 0.4 ptg [ 35 S]RNA, or 6-mercaptopurine (6-MP) to 1 mM final concentration). 

30 Reaction mixtures are incubated at 30°C for 30 minutes, then 65°C for 5 minutes. 

Analysis of [methyl^HJRNA is as follows: (1) 50 /d of 2 x loading buffer (20 mM Tris-HCl, 
pH 7.6, 1 M IiCl, 1 mM EDTA, 1% sodium dodecyl sulphate (SDS)) and 50 /il oligo d(T)-cellulose 
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(10 mg/ml in 1 x loading buffer) are added to the reaction mixture, and incubated at ambient 
temperature with shaking for 30 minutes. (2) Reaction mixtures are transferred to a 96-well filtration 
plate attached to a vacuum apparatus. (3) Each sample is washed sequentially with three 2.4 ml 
aliquots of 1 x oligo d(T) loading buffer containing 0.5% SDS, 0.1% SDS, or no SDS. (4) RNA is 

5 eluted with 300 \i\ of water into a 96-well collection plate, transferred to scintillation vials containing 
liquid scintiDant, and radioactivity determined. 

Analysis of [methyl-*Hl6-MP is as follows: (1) 500 fil 0.5 M borate buffer, pH 10.0, and then 
2.5 ml of 20% (v/v) isoamyl alcohol in toluene are added to the reaction mixtures. (2) The samples 
are mixed by vigorous vortexing for ten seconds. (3) After centrifugation at 700g for 10 minutes, 1.5 

10 ml of the organic phase is transferred to scintillation vials containing 0.5 ml absolute ethanol and liquid 
scintillant, and radioactivity determined. (4) Results are corrected for the extraction of 6-MP into the 
organic phase (approximately 41%). 

In the alternative, type I topoisomerase activity of NAAP can be assayed based on the 
relaxation of a supercoiled DNA substrate. NAAP is incubated with its substrate in a buffer lacking 

15 Mg 2+ and ATP, the reaction is terminated, and the products are loaded on an agarose gel. Altered 
topoisomers can be distinguished from supercoiled substrate electrophoretically. This assay is specific 
for type I topoisomerase activity because Mg 2+ and ATP are necessary cofactors for type II 
topoisomerases. 

Type II topoisomerase activity of NAAP can be assayed based on the decatenation of a 
20 kinetoplast DNA (KDNA) substrate. NAAP is incubated with KDNA, the reaction is terminated, 

and the products are loaded on an agarose gel. Monomelic circular KDNA can be distinguished from 
catenated KDNA electrophoretically. Kits for measuring type I and type II topoisomerase activities 
are available commercially from Topogen (Columbus OH). 

ATP-dependent RNA helicase unwinding activity of NAAP can be measured by the method 
25 described by Zhang and Grosse (1994; Biochemistry 33:3906-3912). The substrate for RNA 

unwinding consists of 32 P-labeled RNA composed of two RNA strands of 194 and 130 nucleotides in 
length containing a duplex region of 17 base-pairs. The RNA substrate is incubated together with 
ATP, Mg 2+ , and varying amounts of NAAP in a Tris-HCl buffer, pH 7.5, at 37°C for 30 minutes. The 
single-stranded RNA product is then separated from the double-stranded RNA substrate by 
30 electrophoresis through a 10% SDS-polyacrylamide gel, and quantitated by autoradiography. The 
amount of single-stranded RNA recovered is proportional to the amount of NAAP in the preparation. 
In the alternative, NAAP function is assessed by expressing the sequences encoding NAAP 
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at physiologically elevated levels in mammalian cell culture systems. cDNA is subcloned into a 
mammalian expression vector containing a strong promoter that drives high levels of cDNA 
expression. Vectors of choice include pCMV SPORT (Life Technologies) and pCR3.1 (Invitrogen 
Corporation, Carlsbad CA), both of which contain the cytomegalovirus promoter. 5-10 fig of 

5 recombinant vector are transiently transfected into a human cell line, preferably of endothelial or 
hematopoietic origin, using either liposome formulations or electroporation. 1-2 fig of an additional 
plasmid containing sequences encoding a marker protein are co- transfected. 

Expression of a marker protein provides a means to distinguish transfected cells from 
nontransfected cells and is a reliable predictor of cDNA expression from the recombinant vector. 

10 Marker proteins of choice include, e.g., Green Fluorescent Protein (GFP; CLONTECH), CD64, or a 
CD64-GFP fusion protein. Flow cytometry (FCM), an automated laser optics-based technique, is 
used to identify transfected cells expressing GFP or CD64-GFP and to evaluate the apoptotic state of 
the cells and other cellular properties. 

FCM detects and quantifies the uptake of fluorescent molecules that diagnose events 

15 preceding or coincident with cell death. These events include changes in nuclear DNA content as 
measured by staining of DNA with propidium iodide; changes in cell size and granularity as measured 
by forward light scatter and 90 degree side light scatter; down-regulation of DNA synthesis as 
measured by decrease in bromodeoxyuridine uptake; alterations in expression of cell surface and 
intracellular proteins as measured by reactivity with specific antibodies; and alterations in plasma 

20 membrane composition as measured by the binding of fluorescein-conjugated Annexin V protein to the 
cell surface. Methods in flow cytometry are discussed in Ormerod, M. G. (1994) Flow Cytometry, 
Oxford, New York NY. 

The influence of NAAP on gene expression can be assessed using highly purified populations 
of cells transfected with sequences encoding NAAP and either CD64 or CD64-GFP. CD64 and 

25 CD64-GFP are expressed on the surface of transfected cells and bind to conserved regions of human 
immunoglobulin G (IgG). Transfected cells are efficiently separated from nontransfected cells using 
magnetic beads coated with either human IgG or antibody against CD64 (DYNAL, Inc., Lake 
Success NY). mRNA can be purified from the cells using methods well known by those of skill in the 
art. Expression of mRNA encoding NAAP and other genes of interest can be analyzed by northern 

30 analysis or microarray techniques. 

Pseudouridine synthase activity of NAAP is assayed using a tritium ( 3 H) release assay 
modified from Nurse et al. ((1995) RNA 1:102-112), which measures the release of 3 H from the C 3 
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position of the pyrimidine component of uridylate (U) when 3 H-radiolabeled U in RNA is isomerized to 
pseudouridine (\|0- A typical 500 fA assay mixture contains 50 mM HEPES buffer (pH 7.5), 100 mM 
ammonium acetate, 5 mM dithiothreitol, 1 mM EDTA, 30 units RNase inhibitor, and 0.1-4.2 /xM 
[5- 3 H]tRNA (approximately 1 jiCi/nmol tRNA). The reaction is initiated by the addition of <5 /il of a 

5 concentrated solution of NAAP (or sample containing NAAP) and incubated for 5 min at 37 °C 
Portions of the reaction mixture are removed at various times (up to 30 min) following the addition of 
NAAP and quenched by dilution into 1 ml 0.1 M HC1 containing Norit-SA3 (12% w/v). The 
quenched reaction mixtures are centrifuged for 5 min at maximum speed in a microcentrifuge, and the 
supernatants are filtered through a plug of glass wool. The pellet is washed twice by resuspension in 1 

10 ml 0.1 M HC1, followed by centrifugation. The supernatants from the washes are separately passed 
through the glass wool plug and combined with the original filtrate. A portion of the combined filtrate 
is mixed with scintillation fluid (up to 10 ml) and counted using a scintillation counter. The amount of 
3 H released from the RNA and present in the soluble filtrate is proportional to the amount of 
peudouridine synthase activity in the sample (Ramamurthy, V. (1999) J. Biol. Chem. 

15 274:22225-22230). 

In the alternative, pseudouridine synthase activity of NAAP is assayed at 30 °C to 37 °C in a 
mixture containing 100 mM Tris-HCl (pH 8.0), 100 mM ammonium acetate, 5 mM MgC^, 2 mM 
dithiothreitol, 0.1 mM EDTA, and 1-2 frnol of [ 32 P]-radiolabeled runoff transcripts (generated in vitro 
by an appropriate RNA polymerase, i.e., T7 or SP6) as substrates. NAAP is added to initiate the 

20 reaction or omitted from the reaction in control samples. Following incubation, the RNA is extracted 
with phenol-chloroform, precipitated in ethanol, and hydrolyzed completely to 3-nucleotide 
monophosphates using RNase T 2 . The hydrolysates are analyzed by two-dimensional thin layer 
chromatography, and the amount of 32 P radiolabel present in the vyMP and UMP spots are evaluated 
after exposing the thin layer chromatography plates to film or a Phosphorlmager screen. Taking into 

25 account the relative number of uridylate residues in the substrate RNA, the relative amount yMP and 
UMP are determined and used to calculate the relative amount of x\f per tRNA molecule (expressed in 
mol y /mol of tRNA or mol \|f /mol of tRN A/minute), which corresponds to the amount of 
pseudouridine synthase activity in the NAAP sample (Lecointe, F. et al. (1998) J. Biol. Chem. 
273:1316-1323). 

30 N^N^-dimethylguanosine transferase ((m 2 2 G)methyltransferase) activity of NAAP is 

measured in a 160 /il reaction mixture containing 100 mM Tris-HCl (pH 7.5), 0.1 mM EDTA, 10 mM 
MgCl^, 20 mM NHA ImM dithiothreitol, 6.2 fiM S-adenosyl-L-[mer/iyPH]methionine (30-70 
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Ci/mM), 8 /ig m 2 2 G-deficient tRNA or wild type tRNA from yeast, and approximately 100 fig of 
purified NAAP or a sample comprising NAAP. The reactions are incubated at 30 °C for 90 min and 
chilled on ice. A portion of each reaction is diluted to 1 ml in water containing 100 /ig BSA. 1 ml of 2 
M HC1 is added to each sample and the acid insoluble products are allowed to precipitate on ice for 20 

5 min before being collected by filtration through glass fiber filters. The collected material is washed 
several times with HC1 and quantitated using a liquid scintillation counter. The amount of 3 H 
incorporated into the m 2 2 G-deficient, acid-insoluble tRNAs is proportional to the amount of 
N^^-dimethylguanosine transferase activity in the NAAP sample. Reactions comprising no 
substrate tRNAs, or wild-type tRNAs that have already been modified, serve as control reactions 

10 which should not yield acid-insoluble 3 H-labeled products. 

Polyadenylation activity of NAAP is measured using an in vitro polyadenylation reaction. 
The reaction mixture is assembled on ice and comprises 10 /il of 5 mM dithiothreitol, 0.025% (v/v) 
NONIDET P-40, 50 mM creatine phosphate, 6.5% (w/v) polyvinyl alcohol, 0.5 unit//il RNAGUARD 
(Pharmacia), 0.025 /xg//xl creatine kinase, 1.25 mM cordycepin 5-triphosphate, and 3.75 mM MgCl^, in 

1 5 a total volume of 25 /xl. 60 fmol of CstF, 50 fmol of CPSF, 240 fmol of PAP, 4 /il of crude or partially 
purified CF II and various amounts of amounts CF I are then added to the reaction mix. The volume 
is adjusted to 23.5 /il with a buffer containing 50 mM TrisHCl, pH 7.9, 10% (v/v) glycerol, and 0.1 mM 
Na-EDTA. The final ammonium sulfate concentration should be below 20 mM. The reaction is 
initiated (on ice) by the addition of 15 fmol of 32 P-labeled pre-mRNA template, along with 2.5 /ig of 

20 unlabeled tRNA, in 1.5 /il of water. Reactions are then incubated at 30 °C for 75-90 min and stopped 
by the addition of 75 /il (approximately two-volumes) of proteinase K mix (0.2 M Tris-HCl, pH 7.9, 
300 mM NaCl, 25 mM Na-EDTA, 2% (w/v) SDS), 1 /il of 10 mg/ml proteinase K, 0.25 /il of 20 mg/ml 
glycogen, and 23.75 /il of water). Following incubation, the RNA is precipitated with ethanol and 
analyzed on a 6% (w/v) polyacrylamide, 8.3 M urea sequencing gel. The dried gel is developed by 

25 autoradiography or using a phosphoimager. Cleavage activity is determined by comparing the amount 
of cleavage product to the amount of pre-mRNA template. The omission of any of the polypeptide 
components of the reaction and substitution of NAAP is useful for identifying the specific biological 
function of NAAP in pre-mRNA polyadenylation (Riiegsegger, U. et al. (1996) J. Biol. Chem. 
271:6107-6113; and references within). 

30 tRNA synthetase activity is measured as the aminoacylation of a substrate tRNA in the 

presence of [ l4 Cl-labeled amino acid. NAAP is incubated with [ l4 C]-labeled amino acid and the 
appropriate cognate tRNA (for example, [ !4 C]alanine and tRNA* 1 ") in a buffered solution. 14 C- 
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labeled product is separated from free [ 14 C]amino acid by chromatography, and the incorporated 14 C 
is quantified by scintillation counter. The amount of l4 C-labeled product detected is proportional to the 
activity of NAAP in this assay. 

In the alternative, NAAP activity is measured by incubating a sample containing NAAP in a 

5 solution containing 1 mM ATP, 5 mM Hepes-KOH (pH 7.0), 2.5 mM KC1, 1.5 mM magnesium 
chloride, and 0.5 mM DTT along with misacylated [ 14 C]-Glu-tRNAGln (e.g., 1 jiM) and a similar 
concentration of unlabeled L-ghitamine. Following the quenching of the reaction with 3 M sodium 
acetate (pH 5.0), the mixture is extracted with an equal volume of waterrsaturated phenol, and the 
aqueous and organic phases are separated by centrifugation at 15,000 x g at room temperature for 1 

10 min. The aqueous phase is removed and precipitated with 3 volumes of ethanol at -70°C for 15 min. 
The precipitated aminoacyl-tRNAs are recovered by centrifugation at 15,000 x g at 4°C forl5 min. 
The pellet is resuspended in of 25 mM KOH, deacylated at 65°C for 10 min., neutralized with 0.1 M 
HC1 (to final pH 6-7), and dried under vacuum. The dried pellet is resuspended in water and spotted 
onto a cellulose TLC plate. The plate is developed in either isopropanol/formic acid/water or 

15 ammonia/water/chloroform/ methanol. The image is subjected to densito metric analysis and the 
relative amounts of Ghi and Gin are calculated based on the Rf values and relative intensities of the 
spots. NAAP activity is calculated based on the amount of Gin resulting from the transformation of 
Ghi while acylated as Ghi-tRNA Gbl (adapted from Curnow, A.W. et al. (1997) Proc. Natl. Acad. Sci. 
USA 94:11819-26). 

20 XIX. Identification of NAAP Agonists and Antagonists 

Agonists or antagonists of NAAP activation or inhibition may be tested using the assays 
described in section XVm. Agonists cause an increase in NAAP activity and antagonists cause a 
decrease in NAAP activity. 



25 Various modifications and variations of the described compositions, methods, and systems of 

the invention will be apparent to those skilled in the art without departing from the scope and spirit of 
the invention. It will be appreciated that the invention provides novel and useful proteins, and their 
encoding polynucleotides, which can be used in the drug discovery process, as well as methods for 
using these compositions for the detection, diagnosis, and treatment of diseases and conditions. 

30 Although the invention has been described in connection with certain embodiments, it should be 
understood that the invention as claimed should not be unduly limited to such specific embodiments. 
Nor should the description of such embodiments be considered exhaustive or limit the invention to the 
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precise forms disclosed. Furthermore, elements from one embodiment can be readily recombined with 
elements from one or more other embodiments. Such combinations can form a number of 
embodiments within the scope of the invention. It is intended that the scope of the invention be 
defined by the following claims and their equivalents. 
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Parameter Threshold 1 
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Description I 


A program that uses a hidden Markov model (HMM) 
to delineate transmembrane segments on protein 
sequences and determine orientation. 


A program that searches amino acid sequences for 
patterns that matched those defined in Prosite. 


[Program 1 


TMHMMER 


Motifs 
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What is claimed is: 

1. An isolated polypeptide selected from the group consisting of: 

a) a polypeptide comprising an amino acid sequence selected from the group consisting 
ofSEQIDNO:l-57, 

b) a polypeptide comprising a naturally occurring amino acid sequence at least 90% 
identical to an amino acid sequence selected from the group consisting of SEQ ID 
NO:l-2, SEQ ID NO:13, SEQ ID NO:16, SEQ ID NO:18-20, SEQ ID NO:22-23, 
SEQ ID NO:31-32, SEQ ID NO:36-39, SEQ ID NO:41, SEQ ID NO:43, SEQ ID 
NO:46, SEQ ID NO:51, and SEQ ID NO:55, 

c) a polypeptide comprising a naturally occurring amino acid sequence at least 91% 
identical to an amino acid sequence selected from the group consisting of SEQ ID 
NO:21,and SEQIDNO:27, 

d) a polypeptide comprising a naturally occurring amino acid sequence at least 94% 
identical to an amino acid sequence selected from the group consisting of SEQ ID 
NO:17, SEQ ID NO:20, and SEQ ID NO:42, 

e) a polypeptide comprising a naturally occurring amino acid sequence at least 95% 
identical to the amino acid sequence of SEQ ID NO:50, 

f) a polypeptide comprising a naturally occurring amino acid sequence at least 96% 
identical to the amino acid sequence of SEQ ID NO: 19, 

g) a polypeptide comprising a naturally occurring amino acid sequence at least 98% 
identical to the amino acid sequence of SEQ ID NO:24, 

h) a polypeptide comprising a naturally occurring amino acid sequence at least 99% 
identical to an amino acid sequence selected from the group consisting of SEQ ID 
NO:9, SEQ ID NO:33, SEQ ED NO:35, SEQ ID NO:52, and SEQ ID NO:57, 

i) a polypeptide consisting essentially of a naturally occurring amino acid sequence at 
least 90% identical to an amino acid sequence selected from the group consisting of 
SEQ ID NO:3-8, SEQ ID NO:10-12, SEQ ID NO:14-15, SEQ ID NO:24-27, SEQ ID 
NO:29, SEQ ID NO:34, SEQ ID NO:40, SEQ ID NO:42, SEQ ID NO:44-45, SEQ ID 
NO:47-49, SEQ ID NO:53-54, and SEQ ID NO:56, 

j) a biologically active fragment of a polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NO:l-57, and 
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k) an immunogenic fragment of a polypeptide having an amino acid sequence selected 
from the group consisting of SEQ ID NO:l-57. 

2. An isolated polypeptide of claim 1 comprising an amino acid sequence selected from the 
5 group consisting of SEQ ID NO:l-57. 

3 . An isolated polynucleotide encoding a polypeptide of claim 1 . 

4. An isolated polynucleotide encoding a polypeptide of claim 2. 

10 

5. An isolated, polynucleotide of claim 4 comprising a polynucleotide sequence selected from 
the group consisting of SEQ ID NO:58-l 14. 

6. A recombinant polynucleotide comprising a promoter sequence operably linked to a 
1 5 polynucleotide of claim 3 . 

7. A cell transformed with a recombinant polynucleotide of claim 6. 

8. A transgenic organism comprising a recombinant polynucleotide of claim 6. 

20 

9. A method of producing a polypeptide of claim 1, the method comprising: 

a) culturing a cell under conditions suitable for expression of the polypeptide, wherein 
said cefl is transformed with a recombinant polynucleotide, and said recombinant 
polynucleotide comprises a promoter sequence operably linked to a polynucleotide 

25 encoding the polypeptide of claim 1, and 

b) recovering the polypeptide so expressed. 

10. A method of claim 9, wherein the polypeptide comprises an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-57. 

30 

11. An isolated antibody which specifically binds to a polypeptide of claim 1. 
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10 



12. An isolated polynucleotide selected from the group consisting of: 

a) a polynucleotide comprising a polynucleotide sequence selected from the group 
consisting of SEQ ID NO:58-l 14, 

b) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 
90% identical to a polynucleotide sequence selected from the group consisting of SEQ 
ID NO:58-67, SEQ ID NO:69-86, SEQ ED NO:88, SEQ ID NO:93, SEQ ID NO:95- 
96, SEQ ID NO:98, SEQ ID NO:100, SEQ ID NO:108, and SEQ ID NO:110-112, 

c) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 
91% identical to the polynucleotide sequence of SEQ ID NO:94, 

d) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 
93% identical to a polynucleotide sequence selected from die group consisting of SEQ 
ID NO:106, and SEQ ID NO:114, 

e) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 
94% identical to a polynucleotide sequence selected from the group consisting of SEQ 

15 ID NO:87, and SEQ ID NO: 103, 

f) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 
96% identical to a polynucleotide sequence selected from the group consisting of SEQ 
ED NO:89-90, SEQ ID NO:92, and SEQ ID NO:101, 

g) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 
20 97% identical to the polynucleotide sequence of SEQ ID NO:99, 

h) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 
98% identical to a polynucleotide sequence selected from the group consisting of SEQ 
ID NO:68, SEQ ED NO:91, and SEQ ED NO:109, 

i) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 

25 99% identical to a polynucleotide sequence selected from the group consisting of SEQ 

ID NO:102, and SEQ ID NO:107, 
j) a polynucleotide consisting essentially of a naturally occurring polynucleotide 

sequence at least 90% identical to a polynucleotide sequence selected from the group 
consisting of SEQ ID NO:97, SEQ ID NO:104-105, and SEQ ID NO:113, 
30 k) a polynucleotide complementary to a polynucleotide of a), 

1) a polynucleotide complementary to a polynucleotide of b), 
m) a polynucleotide complementary to a polynucleotide of c), 
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n) a polynucleotide complementary to a polynucleotide of d), 

o) a polynucleotide complementary to a polynucleotide of e), 

p) a polynucleotide complementary to a polynucleotide of f), 

q) a polynucleotide complementary to a polynucleotide of g), 

r) a polynucleotide complementary to a polynucleotide of h), 

s) a polynucleotide complementary to a polynucleotide of i), 

t) a polynucleotide complementary to a polynucleotide of j), and 

u) an RNA equivalent of a)-t). 

13. An isolated polynucleotide comprising at least 60 contiguous nucleotides of a 
polynucleotide of claim 12. 

14. A method of detecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 12, the method comprising: 

a) hybridizing the sample with a probe comprising at least 20 contiguous nucleotides 
comprising a sequence complementary to said target polynucleotide in the sample, and 
which probe specifically hybridizes to said target polynucleotide, under conditions 
whereby a hybridization complex is formed between said probe and said target 
polynucleotide or fragments thereof, and 

b) detecting the presence or absence of said hybridization complex, and, optionally, if 
present, the amount thereof. 

15. A method of claim 14, wherein the probe comprises at least 60 contiguous nucleotides. 

1 6. A method of detecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 12, the method comprising: 

a) amplifying said target polynucleotide or fragment thereof using polymerase chain 
reaction amplification, and 

b) detecting the presence or absence of said amplified target polynucleotide or fragment 
thereof, and, optionally, if present, the amount thereof. 

17. A composition comprising a polypeptide of claim 1 and a pharmaceutical^ acceptable 
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excipient. 

18. A composition of claim 17, wherein the polypeptide comprises an amino acid sequence 
selected from the group consisting of SEQ ID NO:l-57. 

19. A method for treating a disease or condition associated with decreased expression of 
functional NAAP, comprising administering to a patient in need of such treatment the composition of 
claim 17. 

20. A method of screening a compound for effectiveness as an agonist of a polypeptide of 
claim 1, the method comprising: 

a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 

b) detecting agonist activity in the sample. 

21. A composition comprising an agonist compound identified by a method of claim 20 and a 
pharmaceutically acceptable excipient. 

22. A method for treating a disease or condition associated with decreased expression of 
functional NAAP, comprising administering to a patient in need of such treatment a composition of 
claim 21. 

23. A method of screening a compound for effectiveness as an antagonist of a polypeptide of 
claim 1, the method comprising: 

a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 

b) detecting antagonist activity in the sample. 

24. A composition comprising an antagonist compound identified by a method of claim 23 and 
a pharmaceutically acceptable excipient. 

25. A method for treating a disease or condition associated with overexpression of functional 
NAAP, comprising administering to a patient in need of such treatment a composition of claim 24. 
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26. A method of screening for a compound that specifically binds to the polypeptide of claim 
1, the method comprising: 

a) combining the polypeptide of claim 1 with at least one test compound under suitable 
conditions, and 

b) detecting binding of the polypeptide of claim 1 to the test compound, thereby 
identifying a compound that specifically binds to the polypeptide of claim 1. 

27. A method of screening for a compound that modulates the activity of the polypeptide of 
claim 1, the method comprising: 

a) combining the polypeptide of claim 1 with at least one test compound under conditions 
permissive for the activity of the polypeptide of claim 1 , 

b) assessing the activity of the polypeptide of claim 1 in the presence of the test 
compound, and 

c) comparing the activity of the polypeptide of claim 1 in the presence of the test 
compound with the activity of the polypeptide of claim 1 in the absence of the test 
compound, wherein a change in the activity of the polypeptide of claim 1 in the 
presence of the test compound is indicative of a compound that modulates the activity 
of the polypeptide of claim 1. 

28. A method of screening a compound for effectiveness in altering expression of a target 
polynucleotide, wherein said target polynucleotide comprises a sequence of claim 5, the method 
comprising: 

a) exposing a sample comprising the target polynucleotide to a compound, under 
conditions suitable for the expression of the target polynucleotide, 

b) detecting altered expression of the target polynucleotide, and 

c) comparing the expression of the target polynucleotide in the presence of varying 
amounts of the compound and in the absence of the compound. 

29. A method of assessing toxicity of a test compound, the method comprising: 

a) treating a biological sample containing nucleic acids with the test compound, 

b) hybridizing the nucleic acids of the treated biological sample with a probe comprising 
at least 20 contiguous nucleotides of a polynucleotide of claim 12 under conditions 
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whereby a specific hybridization complex is formed between said probe and a target 
polynucleotide in the biological sample, said target polynucleotide comprising a 
polynucleotide sequence of a polynucleotide of claim 12 or fragment thereof, 

c) quantifying the amount of hybridization complex, and 

d) comparing the amount of hybridization complex in the treated biological sample with 
the amount of hybridization complex in an untreated biological sample, wherein a 
difference in the amount of hybridization complex in the treated biological sample is 
indicative of toxicity of the test compound. 

30. A method for a diagnostic test for a condition or disease associated with the expression of 
NAAP in a biological sample, the method comprising: 

a) combining the biological sample with an antibody of claim 11, under conditions suitable 
for the antibody to bind the polypeptide and form an antibody:polypeptide complex, 
and 

b) detecting the complex, wherein the presence of the complex correlates with the 
presence of the polypeptide in the biological sample. 

3 1 . The antibody of claim 1 1 , wherein the antibody is : 

a) a chimeric antibody, 

b) a single chain antibody, 

c) a Fab fragment, 

d) a F(ab') 2 fragment, or 

e) a humanized antibody. 

32. A composition comprising an antibody of claim 11 and an acceptable excipient 

33. A method of diagnosing a condition or disease associated with the expression of NAAP 
in a subject, comprising administering to said subject an effective amount of the composition of claim 
32. 

34. A composition of claim 32, further comprising a label. 

35. A method of diagnosing a condition or disease associated with the expression of NAAP 
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in a subject, comprising administering to said subject an effective amount of the composition of claim 
34. 

36. A method of preparing a polyclonal antibody with the specificity of the antibody of claim 
1 1 , the method comprising: 

a) immunizing an animal with a polypeptide consisting of an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-57, or an immunogenic fragment 
thereof, under conditions to elicit an antibody response, 

b) isolating antibodies from the animal, and 

c) screening the isolated antibodies with the polypeptide, thereby identifying a polyclonal 
antibody which specifically binds to a polypeptide comprising an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-57. 

37. A polyclonal antibody produced by a method of claim 36. 

38. A composition comprising the polyclonal antibody of claim 37 and a suitable carrier. 

39. A method of making a monoclonal antibody with the specificity of the antibody of claim 
1 1, the method comprising: 

a) immunizing an animal with a polypeptide consisting of an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-57, or an immunogenic fragment 
thereof, under conditions to elicit an antibody response, 

b) isolating antibody producing cells from the animal, 

c) fusing the antibody producing cells with immortalized cells to form monoclonal 
antibody-producing hybridoma cells, 

d) culturing the hybridoma cells, and 

e) isolating from the culture monoclonal antibody which specifically binds to a 
polypeptide comprising an amino acid sequence selected from the group consisting of 
SEQIDNO:l-57. 

40. A monoclonal antibody produced by a method of claim 39. 

41. A composition comprising the monoclonal antibody of claim 40 and a suitable carrier. 
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42. The antibody of claim 1 1 » wherein the antibody is produced by screening a Fab expression 

library. 

43. The antibody of claim 11, wherein the antibody is produced by screening a recombinant 
5 immunoglobulin library. 

44. A method of detecting a polypeptide comprising an amino acid sequence selected from 
the group consisting of SEQ ID NO:l-57 in a sample, the method comprising: 

a) incubating the antibody of claim 1 1 with the sample under conditions to allow specific 
L0 binding of the antibody and the polypeptide, and 

b) detecting specific binding, wherein specific binding indicates the presence of a 
polypeptide comprising an amino acid sequence selected from the group consisting of 
SEQ ID NO:l-57 in the sample. 

15 45. A method of purifying a polypeptide comprising an amino acid sequence selected from 

the group consisting of SEQ ID NO:l-57 from a sample, the method comprising: 

a) incubating the antibody of claim 11 with the sample under conditions to allow specific 
binding of the antibody and the polypeptide, and 

b) separating the antibody from the sample and obtaining the purified polypeptide 

20 comprising an amino acid sequence selected from the group consisting of SEQ ID 

NO:l-57. 



46. A microarray wherein at least one element of the microarray is a polynucleotide of claim 



13. 

25 



47. A method of generating an expression profile of a sample which contains polynucleotides, 
the method comprising: 

a) labeling the polynucleotides of the sample, 

b) contacting the elements of the microarray of claim 46 with the labeled polynucleotides 
30 of the sample under conditions suitable for the formation of a hybridization complex, 

and 

c) quantifying the expression of the polynucleotides in the sample. 
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48. An array comprising different nucleotide molecules affixed in distinct physical locations 
on a solid substrate, wherein at least one of said nucleotide molecules comprises a first oligonucleotide 
or polynucleotide sequence specifically hybridizable with at least 30 contiguous nucleotides of a target 
polynucleotide, and wherein said target polynucleotide is a polynucleotide of claim 12. 

5 

49. An array of claim 48, wherein said first oligonucleotide or polynucleotide sequence is 
completely complementary to at least 30 contiguous nucleotides of said target polynucleotide. 

50. An array of claim 48, wherein said first oligonucleotide or polynucleotide sequence is 
10 completely complementary to at least 60 contiguous nucleotides of said target polynucleotide. 

51. An array of claim 48, wherein said first oligonucleotide or polynucleotide sequence is 
completely complementary to said target polynucleotide. 

15 52. An array of claim 48, which is amicroarray. 

53. An array of claim 48, further comprising said target polynucleotide hybridized to a 
nucleotide molecule comprising said first oligonucleotide or polynucleotide sequence. 

20 54. An array of claim 48, wherein a linker joins at least one of said nucleotide molecules to 

said solid substrate. 

55. An array of claim 48, wherein each distinct physical location on the substrate contains 
multiple nucleotide molecules, and the multiple nucleotide molecules at any single distinct physical 

25 location have the same sequence, and each distinct physical location on the substrate contains 

nucleotide molecules having a sequence which differs from the sequence of nucleotide molecules at 
another distinct physical location on the substrate. 

56. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:l. 

30 

57. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ED NO:2. 

58. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:3. 
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59. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ED NO:4. 

60. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:5. 

61. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:6. 

62. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:7. 

63. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:8. 

64. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:9. 

65. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:10. 

66. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:l 1. 

67. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 12. 

68. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:13. 

69. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:14. 

70. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:15. 

71. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:16. 

72. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:17. 

73. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 18. 

74. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 19. 

75. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:20. 
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76. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:21. 

77. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:22. 

78. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:23. 

79. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:24. 

80. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:25. 

81. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:26. 

82. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ED NO:27. 

83. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:28. 

84. A polypeptide of claim 1, comprising die amino acid sequence of SEQ ID NO:29. 

85. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:30. 

86. A polypeptide of claim 1, comprising die amino acid sequence of SEQ ED NO:3 1. 

87. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:32. 

88. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:33. 

89. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:34. 

90. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:35. 

91. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:36. 

92. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ED NO:37. 
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93. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:38. 

94. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:39. 

95. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:40. 

96. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:41. 

97. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:42. 

98. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:43. 

99. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:44. 

100. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:45. 

101. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ED NO;46. 

102. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:47. 

103. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ED NO:48. 

104. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 49. 

105. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:50. 

106. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:51. 

107. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:52. 

108. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ED NCh53. 

109. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:54. 
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1 10. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:55. 

1 1 1 . A polypeptide of claim 1 , comprising the amino acid sequence of SEQ ID NO:56. 
5 1 12. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:57. 

113. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:58. 

10 1 14. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:59. 

1 IS. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:60. 

15 

1 16. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:61. 

1 17. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

20 NO:62. 

1 18. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:63. 

25 1 19. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:64. 

120. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:65. 

30 

121. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:66. 
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122. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:67. 

123. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

5 NO:68. 

124. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:69. 

10 125. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:70. 

126. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:71. 

15 

127. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:72. 

128. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

20 NO:73. 

129. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:74. 

25 130. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:75. 

131. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ED 

NO:76. 

30 

132. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:77. 
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133. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:78. 

134. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

5 NO:79. 

135. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:80. 

10 136. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:81. 

137. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:82. 

15 

138. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:83. 

139. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

20 NO:84. 

140. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:85. 

25 141. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:86. 

142. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:87. 

30 

143. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:88. 
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144. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:89. 

145. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

5 NO:90. 

146. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:91. 

10 147. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:92. 

148. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:93. 

15 

149. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:94. 

150. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

20 NO:95. 

151. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:96. 

25 152. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:97. 

153. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:98. 

30 

154. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:99. 
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155. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:100. 

156. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
5 NO:101. 

157. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO: 102. 

10 158. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO;103. 

159. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:104. 

15 

160. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:105. 

161. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
20 NO: 106. 

162. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:107. 

25 163. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:108. 

164. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:109. 

30 

165. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:110. 
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166. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO: 111. 

167. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
5 NO:112. 

168. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:113. 

10 169. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO: 114. 
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<110> INCYTE GENOMICS, INC. 
AZIMZAI, Yalda 
B AUGHN , Mariah R. 
BECHA/ Shanya D. 
BOROWSKY, Mark L. 
BURFORD, Neil 
ELLIOTT, Vicki S. 
EMERLING , Brooke M. 
FORSYTHE , Ian J. 
GORVAD, Ann E. 
GRIFFIN, Jennifer A. 
KABLE, Any E. 
KHARE, Reena 
LAL, Preeti G. 
LEE, Ernestine A. 
LEE, Soo Yeun 
LI, Joana X. 
MARQUIS, Joseph P. 
RAMKUMAR, Jayalaxmi 
RICHARDSON, Thomas W. 
S PRAGUE, William W. 
SWARNAKER, Anita 
TANG, Y. Tom 
CHAWLA, Narinder K. 
WARREN, Bridget A. 
YUE, Henry 

<120> NUCLEIC ACID-ASSOCIATED PROTEINS 

<130> PF-1338 PCT 

<140> To Be Assigned 
<141> Herewith 

<150> US 60/343,004 
<151> 2001-12-19 

<150> US 60/347,633 
<151> 2002-01-11 

<150> US 60/351,749 
<151> 2002-01-25 

<150> US 60/359,498 
<151> 2002-02-22 

<160> 114 

<170> PERL Program 

<210> 1 
<211> 709 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 
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<223> Incyte ID No: 7506140CD1 
<400> 1 

Met Ala Ala Asp Ser Arg Glu Glu Lys Asp Gly Glu Leu Asn Val 
15 10 15 

Leu Asp Asp lie Leu Thr Glu Val Pro Glu Gin Asp Asp Glu Leu 
20 25 30 

Tyr Asn Pro Glu Ser Glu Gin Asp Lys Asn Glu Lys Lys Gly Ser 
35 40 45 

Lys Arg Lys Ser Asp Arg Met Glu Ser Thr Asp Thr Lys Arg Gin 
50 55 60 

Lys Pro Ser Val His Ser Arg Gin Leu Val Ser Lys Pro Leu Ser 
65 70 75 

Ser Ser Val Ser Asn Asn Lys Arg lie Val Ser Thr Lys Gly Lys 
80 85 90 

Ser Ala Thr Glu Tyr Lys Asn Glu Glu Tyr Gin Arg Ser Glu Arg 
95 100 105 

Asn Lys Arg Leu Asp Ala Asp Arg Lys lie Arg Leu Ser Ser Ser 

110 115 120 

Ala Ser Arg Glu Pro Tyr Lys Asn Gin Pro Glu Lys Thr Cys Val 

125 130 135 

Arg Lys Arg Asp Pro Glu Arg Arg Ala Lys Ser Pro Thr Pro Asp 

140 145 150 

Gly Ser Glu Arg lie Gly Leu Glu Val Asp Arg Arg Ala Ser Arg 

155 160 165 

Ser Ser Gin Ser Ser Lys Glu Glu Val Asn Ser Glu Glu Tyr Gly 

170 175 180 

Ser Asp His Glu Thr Gly Ser Ser Gly Ser Ser Asp Glu Gin Gly 

185 190 195 

Asn Asn Thr Glu Asn Glu Glu Glu Gly Val Glu Glu Asp Val Glu 

200 205 210 

Glu Asp Glu Glu Glu Glu Glu Glu Glu Glu Glu Glu Glu Tyr Glu 

215 220 225 

Gin Asp Glu Arg Asp Gin Lys Glu Glu Gly Asn Asp Tyr Asp Thr 

230 235 240 

Arg Ser Glu Ala Ser Asp Ser Gly Ser Glu Ser Val Ser Phe Thr 

245 250 255 

Asp Gly Ser Val Arg Ser Gly Ser Gly Thr Asp Gly Ser Asp Glu 

260 265 270 

Lys Lys Lys Glu Arg Lys Arg Ala Arg Gly lie Ser Pro lie Val 

275 280 285 

Phe Asp Arg Ser Gly Ser Ser Ala Ser Glu Ser Tyr Ala Gly Ser 

290 295 300 

Glu Lys Lys His Glu Lys Leu Ser Ser Ser Val Arg Ala Val Arg 

305 310 315 

Lys Asp Gin Thr Ser Lys Leu Lys Tyr Val Leu Gin Asp Ala Arg 

320 325 330 

Phe Phe Leu lie Lys Ser Asn Asn His Glu Asn Val Ser Leu Ala 

335 340 345 

Lys Ala Lys Gly Val Trp Ser Thr Leu Pro Val Asn Glu Lys Lys 

350 355 360 

Leu Asn Leu Ala Phe Arg Ser Ala Arg Ser Val lie Leu lie Phe 

365 370 375 

Ser Val Arg Glu Ser Gly Lys Phe Gin Gly Phe Ala Arg Leu Ser 

380 385 390 

Ser Glu Ser His His Gly Gly Ser Pro lie His Trp Val Leu Pro 
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Ala Gly 
Trp He 
Thr Asn 
Gly Gin 
Phe Pro 
Met Arg 
Arg Pro 
Pro Glu 
Asp Tyr 
Pro Arg 
Arg Arg 
Tyr Asn 
Pro Trp 
His His 
Pro Pro 
Arg Asp 
Leu Arg 
Arg Glu 
Arg Arg 
Arg Glu 
Arg Tyr 



395 

Met Ser Ala 
410 

Cys Arg Arg 
425 

Pro Trp Asn 
440 

Glu He Glu 
455 

Pro Asp Glu 
470 

His Lys Arg 
485 

Ser Arg Arg 
500 

Asp Tyr Asp 
515 

Pro Pro Glu 
530 

Tyr Gin Glu 
545 

Phe Ser Gly 
560 

Asp Tyr Val 
575 

Gin Gly Met 
590 

Pro Tyr Tyr 
605 

Tyr Ser Gly 
620 

Lys Arg Val 
635 

Arg Thr Gin 
650 

Arg Asp Arg 
665 

Asp Arg Glu 
680 

Arg Leu Cys 
695 

Arg Arg 



Lys Met 
Glu Leu 
Glu His 
Leu Glu 
Ser He 
Arg Met 
Glu Pro 
He His 
Phe His 
Val Asp 
Val Arg 
Arg Glu 
Pro Pro 
Gin His 
His His 
His Asp 
Ala Val 
Glu Arg 
Arg Asp 
Asp Arg 



Leu Gly 
Pro Phe 
Lys Pro 
Cys Gly 
Asp Leu 
His Ser 
Val Arg 
Asn Ser 
Gin Arg 
Ser Phe 
Arg Asp 
Phe His 
Tyr Pro 
His Ala 
Pro Val 
Tyr Asp 
Val Ser 
Glu Arg 
Arg Gly 
Asp Arg 



400 

Gly Val 
415 

Thr Lys 
430 

Val Lys 
445 

Thr Gin 
460 

Tyr Gin 
475 

Gin Pro 
490 

Asp Val 
505 

Arg Lys 
520 

Pro Gly 
535 

Thr Asn 
550 

Val Phe 
565 

Asn Met 
580 

Gly Met 
595 

Pro Pro 
610 

Pro His 
625 

Met Arg 
640 

Gly Arg 
655 

Asp Arg 
670 

Arg Asp 
685 

Asp Arg 
700 



Phe Lys 
Ser Ala 
He Gly 
Leu Cys 
Val He 
Arg Ser 
Gly Arg 
Lys Pro 
Tyr Leu 
Leu He 
Leu Asn 
Gly Pro 
Glu Gin 
Pro Gin 
Glu Ala 
Val Asp 
Arg Ser 
Pro Arg 
Arg Glu 
Gly Glu 



405 
He Asp 

420 
His Leu 

435 
Arg Asp 

450 
Leu Leu 

465 
His Lys 

480 
Arg Gly 

495 
Arg Arg 

510 
Arg He 

525 
Lys Asp 

540 
Pro Asn 

555 
Gly Ser 

570 
Pro Pro 

585 
Pro Pro 

600 
Ala His 

615 
Arg Tyr 

630 
Asp Phe 

645 
Arg Pro 

660 
Asp Asn 

675 
Arg Glu 

690 
Arg Gly 

705 



<210> 2 

<211> 986 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 1889415CD1 

<400> 2 

Met Ser Phe Ala Leu Glu Glu Thr Leu Glu Ser Asp Trp Val Ala 
15 10 15 
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Val Arg Pro 
Phe He Val 
Cys His Asn 
Gin Val Qly 
Asp Gly Ser 
Glu Ala Thr 
Arg Ser Ser 
Ser Leu Leu 
Ala Glu Ser 
Gly Gin Lys 
Ala Ala Ala 
Arg He Val 
Leu Glu Met 
Ala Glu Gin 
Glu Cys Ser 
Val His Gin 
Leu Pro Val 
Phe Gly Gly 
Cys Tyr Gin 
Gly Trp Lys 
Pro Glu Glu 
Tyr Glu Glu 
Leu Asp Lys 
Leu Tyr Gin 
Thr Thr Glu 
Arg Glu Leu 
Glu Asn Asp 
Glu Asp Ala 



His Val Phe 
20 

Ala Trp Asn 
35 

Arg Thr Ala 
50 

Ala Arg Gly 
65 

Arg Gly Pro 
80 

Ala Ser Ala 
95 

Ala Trp Ala 
110 

Gly Asp Pro 

125 
Arg Leu Arg 

140 

Thr Ser Glu 
155 

Arg Pro Ala 
170 

Ser Ala Ser 
185 

Val Lys Glu 

200 
Pro Pro Pro 

215 

Trp Ala Gly 

230 
Gin Leu Cys 

245 
Phe Pro Glu 

260 
Ala Pro Glu 

275 
Leu Gin Val 

290 
He Leu Ser 

305 

Tyr Tyr Glu 

320 
Val Leu Gin 

335 
His Lys Asn 

350 
Leu Glu Asp 

365 
Leu Tyr Gin 

380 
Ala Met Leu 

395 
Tyr Leu Gly 

410 
Asp Trp Gin 

425 



Asp Glu Arg 
Glu He Glu 
Gin Arg Gin 
Gly Ala Glu 
Gly Ser Pro 
Thr Leu Val 
Glu Gly Gly 
Arg Leu Arg 
Ser Pro Val 
Ala Asp Asp 
Pro Arg Glu 
Gly Thr Val 
Asp Glu Ala 
Ala Thr Glu 
Leu Phe Ser 
Ser Val Asn 
Glu Pro Ser 
Met Thr Glu 
Tyr Leu Gly 
Gin Val Leu 
Ser Leu Ser 
Arg Ala Arg 
Thr Glu Ser 
Glu Ala Tyr 
Tyr Leu Leu 
Arg Arg Gin 
Pro Arg Arg 
Arg Lys Ala 



Glu Lys His 
25 

Gly Lys Phe 
40 

Arg Ser Gly 
55 

Ala Gly Gly 
70 

Ala Gly Arg 
85 

Arg Ser Pro 
100 

Ser Pro Arg 
115 

Ser Pro Gly 
130 

Arg Ala Lys 
145 

Ala Ala Gly 
160 

Ala Gin Val 
175 

Ser Glu Glu 
190 

Pro Leu Ala 
205 

Leu Glu Ser 
220 

Phe Gin Asp 
235 

Ser Gin Leu 
250 

Gly Met Trp 
265 

Gin Glu He 
280 

His Gly Leu 
295 

Phe Thr Glu 
310 

Glu Leu Arg 
325 

Lys Arg He 
340 

Met Val Glu 
355 

Ser Ser Leu 
370 

Gin Pro Phe 
385 

Gin He Lys 
400 

He Glu Ser 
415 

His Met Ala 
430 



Lys Phe Val 
30 

Ala He Thr 
45 

Ser Arg Glu 
60 

Ala Ala Ser 
75 

Gly Arg Pro 
90 

Gly Pro Arg 
105 

Ser Thr Arg 
120 

Ser Lys Gly 
135 

Pro He Pro 
150 

Ala Ala Ala 
165 

Ser Ser Val 
180 

He Glu Val 
195 

Leu Ser Asp 
210 

Pro Ala Glu 
225 

Leu Arg Ala 
240 

Glu Pro Cys 
255 

Thr Val Leu 
270 

Asp Thr Leu 
285 

Asp Thr Cys 
300 

Thr Asp Asp 
315 

Gin Lys Gly 
330 

Gin Glu Leu 
345 

Leu Leu Asp 
360 

Ala Glu Ala 
375 

Arg Asp Met 
390 

He Ser Met 
405 

Leu Gin Lys 
420 

Val Leu Ser 
435 
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lie Gin Asp Leu Thr Val Lys Tyr Phe Olu lie Thr Ala Lys Ala 

440 445 450 

Qln Lys Ala Val Tyr Asp Arg Met Arg Ala Asp Gin Lys Lys Phe 

455 460 465 

Gly Lys Ala Ser Trp Ala Ala Ala Ala Glu Arg Met Glu Lys Leu 

470 475 480 

Gin Tyr Ala Val Ser Lys Glu Thr Leu Gin Met Met Arg Ala Lys 

485 490 495 

Glu lie Cys Leu Glu Gin Arg Lys His Ala Leu Lys Glu Glu Met 

500 505 510 

Gin Ser Leu Arg Gly Gly Thr Glu Ala lie Ala Arg Leu Asp Gin 

515 520 525 

Leu Glu Ala Asp Tyr Tyr Asp Leu Gin Leu Gin Leu Tyr Glu Val 

530 535 540 

Gin Phe Glu He Leu Lys Cys Glu Glu Leu Leu Leu Thr Ala Gin 

545 550 555 

Leu Glu Ser He Lys Arg Leu He Ser Glu Lys Arg Asp Glu Val 

560 565 570 

Val Tyr Tyr Asp Thr Tyr Glu Ser Met Glu Ala Met Leu Glu Lys 

575 580 585 

Glu Glu Met Ala Ala Ser Ala Tyr Leu Gin Arg Glu Glu Leu Gin 

590 595 600 

Lys Leu Gin Gin Lys Ala Arg Gin Leu Glu Ala Arg Arg Gly Arg 

605 610 615 

Val Ser Ala Lys Lys Ser Tyr Leu Arg Asn Lys Lys Glu He Cys 

620 625 630 

He Ala Lys His Asn Glu He Gin Gin Arg Thr Arg He Glu Asp 

635 640 645 

Glu Tyr Arg Thr His His Thr Val Gin Leu Arg Glu Lys Leu His 

650 655 660 

Asp Glu Glu Glu Arg Lys Ser Ala Trp Val Ser Gin Glu Arg Gin 

665 670 675 

Arg Thr Leu Asp Arg Leu Arg Thr Phe Lys Gin Arg Tyr Pro Gly 

680 685 690 

Gin Val He Leu Lys Ser Thr Arg Leu Arg Leu Ala His Ala Arg 

695 700 705 

Arg Lys Gly Ala Ala Ser Pro Val Leu Gin Glu Asp His Cys Asp 

710 715 720 

Ser Leu Pro Ser Val Leu Gin Val Glu Glu Lys Thr Glu Glu Val 

725 730 . 735 

Gly Glu Gly Arg Val Lys Arg Gly Pro Ser Gin Thr Thr Glu Pro 

740 745 750 

Gin Ser Leu Val Gin Leu Glu Asp Thr Ser Leu Thr Gin Leu Glu 

755 760 765 

Ala Thr Ser Leu Pro Leu Ser Gly Val Thr Ser Glu Leu Pro Pro 

770 775 780 

Thr lie Ser Leu Pro Leu Leu Asn Asn Asn Leu Glu Pro Cys Ser 

785 790 795 

Val Thr He Asn Pro Leu Pro Ser Pro Leu Pro Pro Thr Pro Pro 

800 805 810 

Pro Pro Pro Pro Pro Pro Pro Pro Pro Pro Pro Pro Pro Leu Pro 

815 820 825 

Val Ala Lys Asp Ser Gly Pro Glu Thr Leu Glu Lys Asp Leu Pro 

830 835 840 

Arg Lys Glu Gly Asn Asp Lys Arg He Pro Lys Ser Ala Ser Ala 

845 850 855 
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pro Ser Ala His Leu Phe Asp Ser Ser Gin Leu Val Ser Ala Arg 

860 865 870 

Lys Lys Leu Arg Lys Thr Ala Glu Gly Leu Gin Arg Arg Arg Val 

875 880 885 

Ser Ser Pro Met Asp Glu Val Leu Ala Ser Leu Lys Arg Gly Ser 

890 895 900 

Phe His Leu Lys Lys Val Glu Gin Arg Thr Leu Pro Pro Phe Pro 

905 910 915 

Asp Glu Asp Asp Ser Asn Asn lie Leu Ala Gin He Arg Lys Gly 

920 925 930 

Val Lys Leu Lys Lys Val Gin Lys Asp Val Leu Arg Glu Ser Phe 

935 940 945 

Thr Leu Leu Pro Asp Thr Asp Pro Leu Thr Arg Ser He His Glu 

950 955 960 

Ala Leu Arg Arg He Lys Glu Ala Ser Pro Glu Ser Glu Asp Glu 

965 970 975 

Glu Glu Ala Leu Pro Cys Thr Asp Trp Glu Asn 

980 985 

<210> 3 

<211> 714 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> mi s c_ feature 

<223> Incyte ID No: 7506047CD1 

<400> 3 

Met Pro Asp His Asp Ser Thr Ala Leu Leu Ser Arg Gin Thr Lys 
15 10 15 

Arg Arg Arg Val Asp He Gly Val Lys Arg Thr Val Gly Thr Ala 
20 25 30 

Ser Ala Phe Phe Ala Lys Ala Arg Ala Thr Phe Phe Ser Ala Met 
35 40 45 

Asn Pro Gin Gly Ser Glu Gin Asp Val Glu Tyr Ser Val Val Gin 
50 55 60 

His Ala Asp Gly Glu Lys Ser Asn Val Leu Arg Lys Leu Leu Lys 
65 70 75 

Asn Asn Met Asn Lys Asn Gly Gly Thr Glu Pro Ser Phe Gin Ala 
80 85 90 

Ser Gly Leu Ser Ser Thr Gly Ser Glu Val His Gin Glu Asp He 
95 100 105 

Cys Ser Asn Ser Ser Arg Asp Ser Pro Pro Glu Cys Leu Ser Pro 
110 115 120 

Phe Gly Arg Pro Thr Met Ser Gin Phe Asp Met Asp Arg Leu Cys 
125 130 135 

Asp Glu His Leu Arg Ala Lys Arg Ala Arg Val Glu Asn He He 
140 145 150 

Arg Gly Met Ser His Ser Pro Ser Val Ala Leu Arg Gly Asn Glu 
155 160 165 

Asn Glu Arg Glu Met Ala Pro Gin Ser Val Ser Pro Arg Glu Ser 
170 175 180 

Tyr Arg Glu Asn Lys Arg Lys Gin Lys Leu Pro Gin Gin Gin Gin 
185 190 195 

Gin Ser Phe Gin Gin Leu Val Ser Ala Arg Lys Glu Gin Lys Arg 
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200 205 210 

Glu Glu Arg Arg Gin Leu Lys Gin Gin Leu Glu Asp Met Gin Lys 

215 220 225 

Gin Leu Arg Gin Leu Gin Glu Lys Phe Tyr Gin He Tyr Asp Ser 

230 235 240 

Thr Asp Ser Glu Asn Asp Glu Asp Gly Asn Leu Ser Glu Asp Ser 

245 250 255 

Met Arg Ser Glu He Leu Asp Ala Arg Ala Gin Asp Ser Val Gly 

260 265 270 

Arg Ser Asp Asn Glu Met Cys Glu Leu Asp Pro Gly Gin Phe He 

275 280 285 

Asp Arg Ala Arg Ala Leu He Arg Glu Gin Glu Met Ala Glu Asn 

290 295 300 

Lys Pro Lys Arg Glu Gly Asn Asn Lys Glu Arg Asp His Gly Pro 

305 310 315 

Asn Ser Leu Gin Pro Glu Gly Lys His Leu Ala Glu Thr Leu Lys 

320 325 330 

Gin Glu Leu Asn Thr Ala Met Ser Gin Val Val Asp Thr Val Val 

335 340 345 

Lys Val Phe Ser Ala Lys Pro Ser Arg Gin Val Pro Gin Val Phe 

350 355 360 

Pro Pro Leu Gin He Pro Gin Ala Arg Phe Ala Val Asn Gly Glu 

365 370 375 

Asn His Asn Phe His Thr Ala Asn Gin Arg Leu Gin Cys Phe Gly 

380 385 390 

Asp Val He He Pro Asn Pro Leu Asp Thr Phe Gly Asn Val Gin 

395 400 405 

Met Ala Ser Ser Thr Asp Gin Thr Glu Ala Leu Pro Leu Val Val 

410 415 420 

Arg Lys Asn Ser Ser Asp Gin Ser Ala Ser Gly Pro Ala Ala Gly 

425 430 435 

Gly His His Gin Pro Leu His Gin Ser Pro Leu Ser Ala Thr Thr 

440 445 450 

Gly Phe Thr Thr Ser Thr Phe Arg His Pro Phe Pro Leu Pro Leu 

455 460 465 

Met Ala Tyr Pro Phe Gin Ser Pro Leu Gly Ala Pro Ser Gly Ser 

470 475 480 

Phe Ser Gly Lys Asp Arg Ala Ser Pro Glu Ser Leu Asp Leu Thr 

485 490 495 

Arg Asp Thr Thr Ser Leu Arg Thr Lys Met Ser Ser His His Leu 

500 505 510 

Ser His His Pro Cys Ser Pro Ala His Pro Pro Ser Thr Ala Glu 

515 520 525 

Gly Leu Ser Leu Ser Leu He Lys Ser Glu Cys Gly Asp Leu Gin 

530 535 540 

Asp Met Ser Glu He Ser Pro Tyr Ser Gly Ser Ala Met Gin Glu 

545 550 555 

Gly Leu Ser Pro Asn His Leu Lys Lys Ala Lys Leu Met Phe Phe 

560 565 570 

Tyr Thr Arg Tyr Pro Ser Ser Asn Met Leu Lys Thr Tyr Phe Ser 

575 580 585 

Asp Val Lys Phe Asn Arg Cys He Thr Ser Gin Leu He Lys Trp 

590 595 600 

Phe Ser Asn Phe Arg Glu Phe Tyr Tyr He Gin Met Glu Lys Tyr 

605 610 615 

Ala Arg Gin Ala He Asn Asp Gly Val Thr Ser Thr Glu Glu Leu 
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620 625 630 

Ser lie Thr Arg Asp Cys Glu Leu Tyr Arg Ala Leu Asn Met His 

635 640 645 

Tyr Asn Lys Ala Asn Asp Phe Qlu Val Pro Glu Arg Phe Leu Glu 

650 655 660 

Val Ala Gin He Thr Leu Arg Glu Phe Phe Asn Ala He He Ala 

665 670 675 

Gly Lys Asp Val Asp Pro Ser Trp Lys Lys Ala He Tyr Lys Val 

680 685 690 

He Cys Lys Leu Asp Ser Glu Val Pro Glu He Phe Lys Ser Pro 

695 700 705 

Asn Cys Leu Gin Glu Leu Leu His Glu 

710 

<210> 4 
<211> 166 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7505849CD1 

<400> 4 



Met 


Ser 


Lys 


Leu 


Gly 


Arg Ala Ala Arg Gly 


Leu 


Arg 


Lys 


Pro 


Glu 


1 








5 




10 










15 


Arg Gly 


Val 


Ser 


He 


Asn 


Gin Phe Cys Lys 


Glu 


Phe 


Asn Glu 


Arg 










20 




25 










30 


Thr 


Lys 


Asp 


He 


Lys 


Glu Gly He Pro Leu 


Pro 


Thr 


Lys 


He 


Leu 










35 




40 










45 


Val 


Lys 


Pro 


Asp 


Arg 


Thr 


Phe Glu He Lys 


He Gly Gin 


Pro 


Thr 










50 




55 










60 


Val 


Ser 


Tyr 


Phe 


Leu 


Lys 


Ala Ala Ala Gly 


He 


Glu 


Lys 


Gly Ala 










65 




70 










75 


Arg 


Gin 


Thr Gly 


Lys 


Glu 


Val Ala Gly Leu 


Val 


Thr 


Leu 


Lys 


His 










80 




85 










90 


Val 


Tyr 


Glu 


He 


Ala 


Arg 


He Lys Ala Gin 


Asp Glu Ala 


Phe 


Ala 










95 




100 










105 


Leu 


Gin 


Asp Val 


Pro 


Leu 


Ser Ser Val Val 


Arg 


Ser 


He 


He Gly 










110 




115 










120 


Ser 


Ala 


Arg 


Ser 


Leu 
125 


Gly 


He Arg Val Val 
130 


Lys 


Asp 


Leu 


Ser 


Ser 
135 


Glu 


Glu 


Leu 


Ala 


Ala 
140 


Phe 


Gin Lys Glu Arg 
145 


Ala 


He 


Phe 


Leu 


Ala 
150 


Ala 


Gin 


Lys 


Glu 


Ala 
155 


Asp 


Leu Ala Ala Gin 
160 


Glu 


Glu 


Ala 


Ala 


Lys 
165 



Lys 



<210> 5 
<211> 142 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_ feature 
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<223> Incyte ID No: 7505972CD1 
<400> 5 

Met Ala Ser Asp Glu Gly Lys Leu Phe Val Gly Gly Leu Ser Phe 
15 10 15 

Asp Thr Asn Glu Gin Ser Leu Glu Gin Val Phe Ser Lys Tyr Gly 

20 25 30 

Gin lie Ser Glu Val Val Val Val Lys Asp Arg Glu Thr Gin Arg 

35 40 45 

Ser Arg Gly Phe Gly Phe Val Thr Phe Glu Asn lie Asp Asp Ala 

50 55 60 

Lys Asp Ala Met Met Ala Met Asn Gly Lys Ser Val Asp Gly Arg 

65 70 75 

Gin lie Arg Gly Arg Gly Arg Gly Phe Ser Arg Gly Gly Gly Asp 

80 85 90 

Arg Gly Tyr Gly Gly Asn Arg Phe Glu Ser Arg Ser Gly Gly Tyr 

95 100 105 

Gly Gly Ser Arg Asp Tyr Tyr Ser Ser Arg Ser Gin Ser Gly Gly 
110 115 120 

Tyr Ser Asp Arg Ser Ser Gly Gly Ser Tyr Arg Asp Ser Tyr Asp 
125 130 135 

Ser Tyr Ala Thr His Asn Glu 
140 

<210> 6 

<211> 317 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 7505991CD1 

<400> 6 

Met Glu Val Leu Arg Pro Gin Leu lie Arg lie Asp Gly Arg Asn 
15 10 15 

Tyr Arg Lys Asn Pro Val Gin Glu Gin Thr Tyr Gin His Glu Glu 
20 25 30 

Asp Glu Glu Asp Phe Tyr Gin Gly Ser Met Glu Cys Ala Asp Glu 
35 40 45 

Pro Cys Asp Ala Tyr Glu Val Glu Gin Thr Pro Gin Gly Phe Arg 
50 55 60 

Ser Thr Leu Arg Ala Pro Ser Leu Leu Tyr Lys His He Val Gly 
65 70 75 

Lys Arg Gly Asp Thr Arg Lys Lys He Glu Met Glu Thr Lys Thr 
80 85 90 

Ser He Ser He Pro Lys Pro Gly Gin Asp Gly Glu lie Val He 
95 100 105 

Thr Gly Gin His Arg Asn Gly Val He Ser Ala Arg Thr Arg He 
110 115 120 

Asp Val Leu Leu Asp Thr Phe Arg Arg Lys Gin Pro Phe Thr His 
125 130 135 

Phe Leu Ala Phe Phe Leu Asn Glu Val Glu Val Gin Glu Gly Phe 
140 145 150 

Leu Arg Phe Gin Glu Glu Val Leu Ala Lys Cys Ser Met Asp His 
155 160 165 
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Gly Val 
Thr He 
Thr Cys 
Gin Glu 
Leu He 
Met Asn 
Asn Leu 
Ser Phe 
Phe Gly 
Thr Val 
Phe Ser 



Asp Ser 
Gly Met 
Glu Met 
Leu Val 
Val Lys 
Thr Leu 
Tyr Thr 
Asp Gly 
Ser Leu 
Asp Ser 



Ser He 
170 

Leu Val 
185 

Leu Gin 
200 

Asp Arg 
215 

Glu Trp 
230 

Phe Arg 
245 

Ala Glu 
260 

Arg Asn 
275 

Lys Leu 
290 

Phe Gly 
305 



Phe Gin 
Leu Leu 
Gin Cys 
Val Leu 
Asn Ser 
Lys Asp 
Gly Lys 
He Leu 
Asn Ser 
Asn Tyr 



Asn Pro 

175 
Ser Glu 

190 
Lys Glu 

205 
Glu Arg 

220 
Val Lys 

235 
Pro Asn 

250 
Tyr He 

265 
Lys Leu 

280 
He His 

295 
Ala Ser 

310 



Lys Lys 
Glu Glu 
Glu Phe 
Phe Gin 
Leu His 
Ala Glu 
Phe Lys 
Phe Glu 
He Ser 
Cys Gly 



Leu His Leu 
180 

He Gin Gin 
195 

He Lys Leu 
210 

Ala Ser Gly 
225 

Ala Thr Val 
240 

Gly Arg Tyr 
255 

Glu Arg Glu 
270 

Asn Phe Tyr 
285 

Gin Arg Phe 
300 

Gin He Asp 
315 



<210> 7 

<211> 1359 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 7506003CD1 



<400> 7 
Met Val 
1 

Ser His 
Gin Ala 
Asn Gin 
Gly Thr 
Leu Ala 
Pro Cys 
Gin Met 
Glu Lys 
Tyr Leu 
Arg Lys 
Val Lys 



Lys Glu 
He Cys 
His He 
His Ala 
Ser Glu 
Asp Cys 
Phe His 
He Cys 
Lys Gin 
Gin Lys 
Lys Asn 
Lys Cys 



Gin Phe Arg 
5 

Phe Gly Met 
20 

Gin Val Val 
35 

Pro Leu Leu 
50 

Lys Asp Arg 
65 

Leu Gly His 
80 

Val Gly Tyr 
95 

Lys Thr Cys 
110 

Phe Leu Asp 
125 

Arg Gly Leu 
140 

He Cys His 
155 

Gly Leu Leu 



Glu Thr Asp 
10 

Lys Ser Pro 
25 

Ser Lys Asn 
40 

Tyr Gly Val 
55 

Pro Cys Glu 
70 

Tyr Gly Tyr 
85 

Phe Arg Ala 
100 

Cys His He 
115 

Tyr Leu Lys 
130 

Lys Lys Lys 
145 

His Cys Gly 
160 

Lys He He 



Val Ala 
Glu Glu 
Leu Tyr 
Leu Asp 
Thr Cys 
He Asp 
Val He 
Met Leu 
Arg Pro 
He Ser 
Ala Phe 
His Glu 



Lys Lys He 
15 

Met Arg Gin 
30 

Ser Gin Asp 
45 

His Arg Met 
60 

Gly Lys Asn 
75 

Leu Glu Leu 
90 

Gly He Leu 
105 

Ser Gin Glu 
120 

Gly Leu Thr 
135 

Asp Lys Cys 
150 

Asn Gly Thr 
165 

Lys Tyr Lys 
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1 / U 




1/3 










180 


Thr 


Asn 


Lys 


Lys 


Val 


vai 


Asp rro lie vai 


ber 


Asn 


rne 


Leu 


Gin 










103 




1 Oft 

iy u 










195 


Ser 


Phe 


Glu 


Thr 


Ala 


Tl- 


Glu His Asn Lys 


Glu 


val 


Glu 


Pro 


Leu 










o nn 




one 










210 


Leu Gly Arg Ala 


Gin 


Glu 


Asn Leu Asn Pro 


Leu 


Val 


Val 


Leu 


Asn 










Zi3 




OOft 










225 


Leu 


Phe 


Lys 


Arg 


lie 


Pro 


Ala Glu Asp Val 


Pro 


Leu 


Leu 


Leu 


Met 
























240 


Asn 


Pro 


Glu 


Ala 


vjiy 


Lys 


Pro Ser Asp Leu 


lie 


Leu 


Thr Arg 


Leu 














250 










255 


Leu 


Val 


Pro 


Pro 


Leu 


Cys 


Phe Arg Pro Ser 


Val 


Val 


Ser 


Asp 


Leu 










ZD VJ 




265 










270 


Lys 


Ser Gly Thr 


Asn 


Glu 


Asp Asp Leu Thr 


Met 


Lys 


Leu 


Thr 


Glu 










£, f 3 




280 










285 


He 


He 


Phe 


Leu 


Asn 


Asp 


Val He Lys Lys 


His 


Arg 


He 


Ser 


Gly 














295 










300 


Ala 


Lys Thr Gin 


Met 


lie 


Met Glu Asp Trp Asp 


Phe 


Leu 


Gin 


Leu 










O ft c 




310 










315 


Gin 


Cys 


Ala 


Leu 


Tyr 


lie 


Asn Ser Glu Leu Ser Gly 


He 


Pro 


Leu 










ion 




325 










330 


Asn 


Met 


Ala 


Pro 


Lys 


.Lys 


Trp Thr Arg Gly Phe Val 


Gin 


Arg 


Leu 










J J 3 




340 










345 


Lys Gly Lys Gin 


uiy 


Arg 


Phe Arg Gly Asn Leu 


Ser 


Gly Lys 


Arg 










J jU 




355 










360 


Val 


Asp 


Phe 


Ser 


my 


Arg 


Thr Val He Ser 


Pro 


Asp 


Pro 


Asn 


Leu 










J D 3 




370 










375 


Arg 


He Asp Glu 


val 


Ala 


Val Pro Val His 


Val 


Ala 


Lys 


He 


Leu 










•50U 




385 










390 


Thr 


Phe 


Pro 


Glu 


Lys 


Val 


Asn Lys Ala Asn 


He 


Asn 


Phe 


Leu 


Arg 










.333 




400 










405 


Lys 


Leu 


Val 


Gin 


Asn 


Gly 


Pro Glu Val His 


Pro Gly 


Ala 


Asn 


Phe 










410 




415 










420 


lie 


Gin 


Gin 


Arg 


HIS 


Thr 


Gin Met Lys Arg 


Phe 


Leu 


Lys 


Tyr 


Gly 










A OC 




430 










435 


Asn Arg Glu Lys 


Mat- 
Wet 


Ala 


Gin Glu Leu Lys Tyr Gly 


Asp 


He 


Val 










44 u 




445 










450 


Glu 


Arg His Leu 


lie 


ASP 


Gly Asp Val Val Leu Phe 


Asn 


Arg 


Gin 










ASH 




460 










465 


Pro 


Ser 


Leu 


His 


Lys 


Leu 


Ser He Met Ala 


His 


Leu 


Ala 


Arg 


Val 










4 / v 




475 










480 


Lys 


Pro 


His 


Arg 


Thr 


Pne 


Arg Phe Asn Glu 


Cys 


Val 


Cys 


Thr 


Pro 










vt Q C 
403 




490 










495 


Tyr 


Asn 


Ala 


Asp 


Pne 


Asp 


Gly Asp Glu Met Asn Leu 


His 


Leu 


Pro 










3UU 




505 










510 


Gin 


Thr 


Glu 


Glu 


Ala 


Lys 


Ala Glu Ala Leu 


Val 


Leu 


Met Gly 


Thr 










3l3 




520 










525 


Lys 


Ala 


Asn 


Leu 


val 


Thr 


Pro Arg Asn Gly Glu 


Pro 


Leu 


He 


Ala 










sin 

J J V 




535 










540 


Ala 


He 


Gin 


Asp 


Phe 


Leu 


Thr Gly Ala Tyr 


Leu 


Leu 


Thr 


Leu 


Lys 










545 




550 










555 


Asp 


Thr 


Phe 


Phe 


Asp 


Arg 


Ala Lys Ala Cys 


Gin 


He 


He 


Ala 


Ser 










560 




565 










570 


He 


Leu 


Val 


Gly 


Lys 


Asp 


Glu Lys He Lys 


Val 


Arg 


Leu 


Pro 


Pro 










575 




580 










585 


Pro 


Thr 


lie 


Leu 


Lys 


Pro 


Val Thr Leu Trp Thr Gly 


Lys 


Gin 


He 
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590 










595 










600 


Phe Ser 


Val 


He 


Leu 
605 


Arg 


Pro 


Ser 


Asp 


Asp 
610 


Asn 


Pro 


Val 


Arg 


Ala 
615 


Asn Leu 


Arg 


Thr 


Lys 
620 


Gly 


Lys 


Gin 


Tyr 


Cys 
625 


Gly 


Lys 


Gly 


Glu 


Asp 
630 


Leu Cys 


Ala 


Asn 


Asp 
635 


Ser 


Tyr 


Val 


Thr 


He 
640 


Gin 


Asn 


Ser 


Glu 


Leu 
645 


Met Ser 


Gly 


Ser 


Met 
650 


Asp 


Lys 


Gly 


Thr 


Leu 
655 


Gly 


Ser 


Gly 


Ser 


Lys 
660 


Asn Asn 


He 


Phe 


Tyr 
665 


He 


Leu 


Leu 


Arg 


Asp 
670 


Trp 


Gly 


Gin 


Asn 


Glu 
675 


Ala Ala 


Asp 


Ala 


Met 
680 


Ser 


Arg 


Leu 


Ala 


Arg 
685 


Leu 


Ala 


Pro 


Val 


Tyr 
690 


Leu Ser 


Asn 


Arg 


Gly 
695 


Phe 


Ser 


He 


Gly 


He 
700 


Gly 


Asp 


Val 


Thr 


Pro 
705 


Qly Gin 


Gly 


Leu 


Leu 
710 


Lys 


Ala 


Lys 


Tyr 


Glu 
715 


Leu 


Leu 


Asn 


Ala 


Gly 
720 


Tyr Lys 


Lys 


Cys 


Asp 
725 


Glu 


Tyr 


He 


Glu 


Ala 
730 


Leu 


Asn 


Thr 


Gly 


Lys 
735 


Leu Gin 


Gin 


Gin 


Pro 
740 


Gly 


Cys 


Thr 


Ala 


Glu 
745 


Glu 


Thr 


Leu 


Glu 


Ala 
750 


Leu He 


Leu 


Lys 


Glu 
755 


Leu 


Ser 


Val 


He 


Arg 
760 


Asp 


His 


Ala 


Gly 


Ser 
765 


Ala Cys 


Leu 


Arg 


Glu 
770 


Leu 


Asp 


Lys 


Ser 


Asn 
775 


Ser 


Pro 


Leu 


Thr 


Met 
780 


Ala Leu 


Cys 


Gly 


Ser 
785 


Lys 


Gly 


Ser 


Phe 


He 
790 


Asn 


He 


Ser 


Gin 


Met 
795 


He Ala 


Cys 


Val 


Gly 
800 


Gin 


Gin 


Ala 


He 


Ser 
805 


Gly 


Ser 


Arg 


Val 


Pro 
810 


Asp Gly 


Phe 


Glu 


Asn 
815 


Arg 


Ser 


Leu 


Pro 


His 
820 


Phe 


Glu 


Lys 


His 


Ser 
825 


Lys Leu 


Pro 


Ala 


Ala 
830 


Lys 


Gly 


Phe 


Val 


Ala 
835 


Asn 


Ser 


Phe 


Tyr 


Ser 
840 


Gly Leu 


Thr 


Pro 


Thr 
845 


Glu 


Phe 


Phe 


Phe 


His 
850 


Thr 


Met 


Ala 


Gly 


Arg 
855 


Glu Gly 


Leu 


Val 


Asp 
860 


Thr 


Ala 


Val 


Lys 


Thr 
865 


Ala 


Glu 


Thr 


Gly 


Tyr 
870 


Met Gin 


Arg 


Arg 


Leu 
875 


Val 


Lys 


Ser 


Leu 


Glu 
880 


Asp 


Leu 


Cys 


Ser 


Gin 
885 


Tyr Asp 


Leu 


Thr 


Val 
890 


Arg 


Ser 


Ser 


Thr 


Gly 
895 


Asp 


He 


He 


Gin 


Phe 
900 


He Tyr 


Gly 


Gly 


Asp 
905 


Gly 


Leu 


Asp 


Pro 


Ala 
910 


Ala 


Met 


Glu 


Gly 


Lys 
915 


Asp Glu 


Pro 


Leu 


Glu 
920 


Phe 


Lys 


Arg 


Val 


Leu 
925 


Asp 


Asn 


He 


Lys 


Ala 
930 


Val Phe 


Pro 


Cys 


Pro 
935 


Ser 


Glu 


Pro 


Ala 


Leu 
940 


Ser 


Lys 


Asn 


Glu 


Leu 
945 


He Leu 


Thr 


Thr 


Glu 
950 


Ser 


He 


Met 


Lys 


Lys 
955 


Ser 


Glu 


Phe 


Leu 


Cys 
960 


Cys Gin 


Asp 


Ser 


Phe 
965 


Leu 


Gin 


Glu 


He 


Lys 
970 


Lys 


Phe 


He 


Lys 


Gly 
975 


Val Ser 


Glu 


Lys 


He 
980 


Lys 


Lys 


Thr 


Arg 


Asp 
985 


Lys 


Tyr 


Gly 


He 


Asn 
990 


Asp Asn 


Gly 


Thr 


Thr 


Glu 


Pro 


Arg 


Val 


Leu 


Tyr 


Gin 


Leu 


Asp 


Arg 








995 








1000 








1005 


He Thr 


Pro 


Thr 


Gin 


Val 


Glu 


Lys 


Phe 


Leu 


Glu 


Thr 


Cys 


Arg 


Asp 
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1010 1015 1020 

Lys Tyr Met Arg Ala Gin Met Glu Pro Gly Ser Ala Val Gly Ala 

1025 1030 1035 

Leu Cys Ala Gin Ser lie Gly Glu Pro Gly Thr Gin Met Thr Leu 

1040 1045 1050 

Lys Thr Phe His Phe Ala Gly Val Ala Ser Met Asn lie Thr Leu 

1055 1060 1065 

Gly Val Pro Arg lie Lys Glu lie lie Asn Ala Ser Lys Ala He 

1070 1075 1080 

Ser Thr Pro He He Thr Ala Gin Leu Asp Lys Asp Asp Asp Ala 

1085 1090 1095 

Asp Tyr Ala Arg Leu Val Lys Gly Arg He Glu Lys Thr Leu Leu 

1100 1105 1110 

Gly Glu Val Asn Ala Glu Thr Val Arg Tyr Ser He Cys Thr Ser 

1115 1120 1125 

Lys Leu Arg Val Lys Pro Gly Asp Val Ala Val His Gly Glu Ala 

1130 1135 1140 

Val Val Cys Val Thr Pro Arg Glu Asn Ser Lys Ser Ser Met Tyr 

1145 1150 1155 

Tyr Val Leu Gin Phe Leu Lys Glu Asp Leu Pro Lys Val Val Val 

1160 1165 1170 

Gin Gly He Pro Glu Val Ser Arg Ala Val He His He Asp Glu 

1175 1180 1185 

Gin Ser Gly Lys Glu Lys Tyr Lys Leu Leu Val Glu Gly Asp Asn 

1190 1195 1200 

Leu Arg Ala Val Met Ala Thr His Gly Val Lys Gly Thr Arg Thr 

1205 1210 1215 

Thr Ser Asn Asn Thr Tyr Glu Val Glu Lys Thr Leu Gly He Glu 

1220 1225 1230 

Ala Ala Arg Thr Thr He He Asn Glu He Gin Tyr Thr Met Val 

1235 1240 1245 

Asn His Gly Met Ser He Asp Arg Arg His Val Met Leu Leu Ser 

1250 1255 1260 

Asp Leu Met Thr Tyr Lys Gly Glu Val Leu Gly He Thr Arg Phe 

1265 1270 1275 

Gly Leu Ala Lys Met Lys Glu Ser Val Leu Met Leu Ala Ser Phe 

1280 1285 1290 

Glu Lys Thr Ala Asp His Leu Phe Asp Ala Ala Tyr Phe Gly Gin 

1295 1300 1305 

Lys Asp Ser Val Cys Gly Val Ser Glu Cys He He Met Gly He 

1310 1315 1320 

Pro Met Asn He Gly Thr Gly Leu Phe Lys Leu Leu His Lys Ala 

1325 1330 1335 

Asp Arg Asp Pro Asn Pro Pro Lys Arg Pro Leu He Phe Asp Thr 

1340 1345 1350 

Asn Glu Phe His He Pro Leu Val Thr 

1355 



<210> 8 

<211> 226 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 6483977CD1 
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<400> 8 



Met Ser 


Arg 


Tyr 


Gly 


Arg 


Tyr 


Gly Gly Glu Thr Lys Val Tyr Val 


1 






5 






10 






15 


Gly Asn 


Leu 


Gly 


Thr 


Gly 


Ala 


Gly Lys Gly Glu 


Leu Glu Arg 


Ala 








20 






25 






30 


Phe Ser 


Tyr Tyr 


Gly 


Pro 


Leu 


Arg Thr Val 


Trp 


He Ala Arg 


Asn 








35 






40 






45 


Pro Pro 


Gly 


Phe 


Ala 


Phe 


Val 


Glu Phe Glu 


Asp 


Pro Arg Asp 


Ala 








50 






55 






60 


Glu Asp 


Ala 


Val 


Arg 


Gly 


Leu 


Asp Gly Lys 


Val 


He Cys Gly 


Ser 








65 






70 






75 


Arg Val 


Arg 


Val 


Glu 


Leu 


Ser 


Thr Gly Met 


Pro 


Arg Arg Ser 


Arg 








80 






85 






90 


Phe Asp 


Arg 


Pro 


Pro 


Ala 


Arg 


Arg Pro- Phe 


Asp 


Pro Asn Asp 


Arg 








95 






100 






105 


Cys Tyr 


Glu 


Cys 


Gly 


Glu 


Lys 


Gly His Tyr Ala Tyr Asp Cys 


His 








110 






115 






120 


Arg Tyr 


Ser 


Arg 


Arg 


Arg 


Arg 


Ser Arg Ser Arg 


Ser Arg Ser 


His 








125 






130 






135 


Ser Arg 


Ser 


Arg 


Gly 


Arg 


Arg 


Tyr Ser Arg 


Ser 


Arg Ser Arg 


Ser 








140 






145 






150 


Arg Gly 


Arg 


Arg 


Ser 


Arg 


Ser 


Ala Ser Pro 


Arg 


Arg Ser Arg 


Ser 








155 






160 






165 


lie Ser 


Leu 


Arg 


Arg 


Ser 


Arg 


Ser Ala Ser 


Leu 


Arg Arg Ser 


Arg 








170 






175 






180 


Ser Gly 


Ser 


He 


Lys 


Gly 


Ser 


Arg Tyr Phe 


Gin 


Ser Pro Ser 


Arg 








185 






190 






195 


Ser Arg 


Ser Arg 


Ser 


Arg 


Ser 


He Ser Arg 


Pro 


Arg Ser Ser 


Arg 








200 






205 






210 


Ser Pro 


Ser Gly 


Ser 


Pro 


Arg 


Arg Ser Ala 


Ser 


Pro Glu Arg 


Met 








215 






220 






225 



<210> 9 

<211> 76 

<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 6301777CD1 

<400> 9 

Met Gly Leu Pro Arg Arg Ala Gly Asp Ala Ala Glu Leu Arg Lys 

15 10 15 

Ser Leu Lys Pro Leu Leu Glu Lys Arg Arg Arg Ala Arg He Asn 

20 25 30 

Gin Ser Leu Ser Gin Leu Lys Gly Leu He Leu Pro Leu Leu Gly 

35 40 45 

Arg Glu Asp Ala Ser Gly Trp His Thr Trp Leu Pro Leu His Ala 

50 55 60 

Gin Asn Cys Leu Leu Leu Tyr He Gin Ala Pro Glu Gin Pro Pro 

65 70 75 

Ala 
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<210> 10 

<211> 124 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 7505976CD1 

<400> 10 

Met Ala Ala Thr Glu Pro He Leu Ala Ala Thr Gly Ser Pro Ala 
15 10 15 

Ala Val Pro Pro Glu Lys Leu Glu Gly Ala Gly Ser Ser Ser Ala 

20 25 30 

Pro Glu Arg Asn Cys Val Gly Ser Ser Leu Pro Glu Ala Ser Pro 

35 40 45 

Pro Ala Pro Glu Pro Ser Ser Pro Asn Ala Ala Val Pro Glu Ala 

50 55 60 

He Pro Thr Pro Arg Ala Ala Ala Ser Ala Ala Leu Glu Leu Pro 

65 70 75 

Leu Gly Pro Ala Pro Val Ser Val Ala Pro Gin Ala Glu Ala Glu 

80 85 90 

Ala Arg Ser Thr Pro Gly Pro Ala Gly Ser Arg Leu Gly Pro Glu 

95 100 105 

Thr Phe Arg Gin Arg Phe Arg Gin Phe Arg Arg Arg Thr Asp Val 
110 115 120 

Arg He Thr Gly 



<210> 11 
<211> 488 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 7506016CD1 

<400> 11 

Met Asn Gin Ser He Pro Val Ala Pro Thr Pro Pro Arg Arg Val 
15 10 15 

Arg Leu Lys Pro Trp Leu Val Ala Gin Val Asn Ser Cys Gin Tyr 

20 25 30 

Pro Gly Leu Gin Trp Val Asn Gly Glu Lys Lys Leu Phe Cys He 

35 40 45 

Pro Trp Arg His Ala Thr Arg His Gly Pro Ser Gin Asp Gly Asp 

50 55 60 

Asn Thr He Phe Lys Ala Trp Ala Lys Glu Thr Gly Lys Tyr Thr 

65 70 75 

Glu Gly Val Asp Glu Ala Asp Pro Ala Lys Trp Lys Ala Asn Leu 

80 85 90 

Arg Cys Ala Leu Asn Lys Ser Arg Asp Phe Arg Leu He Tyr Asp 

95 100 105 

Gly Pro Arg Asp Met Pro Pro Gin Pro Tyr Lys He Tyr Glu Val 
110 115 120 

Cys Ser Asn Gly Pro Ala Pro Thr Asp Ser Gin Pro Pro Glu Asp 
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125 130 135 

Tyr Ser Phe Qly Ala Gly Qlu Glu Glu Glu Glu Glu Glu Glu Leu 

140 145 150 

Gin Arg Met Leu Pro Ser Leu Ser Leu Thr Glu Asp Val Lys Trp 

155 160 165 

Pro Pro Thr Leu Gin Pro Pro Thr Leu Gin Pro Pro Val Val Leu 

170 175 180 

Gly Pro Pro Ala Pro Asp Pro Ser Pro Leu Ala Pro Pro Pro Gly 

185 190 195 

Asn Pro Ala Gly Phe Arg Glu Leu Leu Ser Glu Val Leu Glu Pro 

200 205 210 

Gly Pro Leu Pro Ala Ser Leu Pro Pro Ala Gly Glu Gin Leu Leu 

215 220 225 

Pro Asp Leu Leu lie Ser Pro His Met Leu Pro Leu Thr Asp Leu 

230 235 240 

Glu lie Lys Phe Gin Tyr Arg Gly Arg Pro Pro Arg Ala Leu Thr 

245 250 255 

lie Ser Asn Pro His Gly Cys Arg Leu Phe Tyr Ser Gin Leu Glu 

260 265 270 

Ala Thr Gin Glu Gin Val Glu Leu Phe Gly Pro He Ser Leu Glu 

275 280 285 

Gin Val Arg Phe Pro Ser Pro Glu Asp He Pro Ser Asp Lys Gin 

290 295 300 

Arg Phe Tyr Thr Asn Gin Leu Leu Asp Val Leu Asp Arg Gly Leu 

305 310 315 

He Leu Gin Leu Gin Gly Gin Asp Leu Tyr Ala He Arg Leu Cys 

320 325 330 

Gin Cys Lys Val Phe Trp Ser Gly Pro Cys Ala Ser Ala His Asp 

335 340 345 

Ser Cys Pro Asn Pro He Gin Arg Glu Val Lys Thr Lys Leu Phe 

350 355 360 

Ser Leu Glu His Phe Leu Asn Glu Leu He Leu Phe Gin Lys Gly 

365 370 375 

Gin Thr Asn Thr Pro Pro Pro Phe Glu He Phe Phe Cys Phe Gly 

380 385 390 

Glu Glu Trp Pro Asp Arg Lys Pro Arg Glu Lys Lys Leu He Thr 

395 400 405 

Val Gin Val Val Pro Val Ala Ala Arg Leu Leu Leu Glu Met Phe 

410 415 420 

Ser Gly Glu Leu Ser Trp Ser Ala Asp Ser He Arg Leu Gin He 

425 430 435 

Ser Asn Pro Asp Leu Lys Asp Arg Met Val Glu Gin Phe Lys Glu 

440 445 450 

Leu His His He Trp Gin Ser Gin Gin Arg Leu Gin Pro Val Ala 

455 460 465 

Gin Ala Pro Pro Gly Ala Gly Leu Gly Val Gly Gin Gly Pro Trp 

470 475 480 

Pro Met His Pro Ala Gly Met Gin 

485 

<210> 12 

<211> 576 

<212> PRT 

<213> Homo sapiens 

<220> 
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<221> misc_f eature 

<223> Incyte ID No: 7506086CD1 

<400> 12 

Met Leu Ser Leu Lys Lys Tyr Leu Thr Glu Gly Leu Leu Gin Phe 

15 10 15 

Thr He Leu Leu Ser Leu He Gly Val Arg Val Asp Val Asp Thr 

20 25 30 

Tyr Leu Thr Ser Gin Leu Pro Pro Leu Arg Glu He He Leu Gly 

35 40 45 

Pro Ser Ser Ala Tyr Thr Gin Thr Gin Phe His Asn Leu Arg Asn 

50 55 60 

Thr Leu Asp Gly Tyr Gly He His Pro Lys Ser He Asp Leu Asp 

65 70 75 

Asn Tyr Phe Thr Ala Arg Arg Leu Leu Ser Gin Val Arg Ala Leu 

80 85 90 

Asp Arg Phe Gin Val Pro Thr Thr Glu Val Asn Ala Trp Leu Val 

95 100 105 

His Arg Asp Pro Glu Gly Ser Val Ser Gly Ser Gin Pro Asn Ser 

110 115 120 

Gly Leu Ala Leu Glu Ser Ser Ser Gly Leu Gin Asp Val Thr Gly 

125 130 135 

Pro Asp Asn Gly Val Arg Glu Ser Glu Thr Glu Gin Gly Phe Gly 

140 145 150 

Glu Asp Leu Glu Asp Leu Gly Ala Val Ala Pro Pro Val Ser Gly 

155 160 165 

Asp Leu Thr Lys Glu Asp He Asp Leu He Asp He Leu Trp Arg 

170 175 180 

Gin Asp He Asp Leu Gly Ala Gly Arg Glu Val Phe Asp Tyr Ser 

185 190 195 

His Arg Gin Lys Glu Gin Asp Val Glu Lys Glu Leu Arg Asp Gly 

200 205 210 
Gly Glu Gin Asp Thr Trp Ala Gly Glu Gly Ala Glu Ala Leu Ala 

215 220 225 
Arg Asn Leu Leu Val Asp Gly Glu Thr Gly Glu Ser Phe Pro Ala 

230 235 240 

Gin Val Pro Ser Gly Glu Asp Gin Thr Ala Leu Ser Leu Glu Glu 

245 250 255 

Cys Leu Arg Leu Leu Glu Ala Thr Cys Pro Phe Gly Glu Asn Ala 

260 265 270 

Glu Phe Pro Ala Asp He Ser Ser He Thr Glu Ala Val Pro Ser 

275 280 285 

Glu Ser Glu Pro Pro Ala Leu Gin Asn Asn Leu Leu Ser Pro Leu 

290 295 300 
Leu Thr Gly Thr Glu Ser Pro Phe Asp Leu Glu Gin Gin Trp Gin 

305 310 315 

Asp Leu Met Ser He Met Glu Met Gin Glu Gly Ala Val Gly Tyr 

320 325 330 
Ser Ser Asp Ser Glu Thr Leu Asp Leu Glu Glu Ala Glu Gly Ala 

335 340 345 

Val Gly Tyr Gin Pro Glu Tyr Ser Lys Phe Cys Arg Met Ser Tyr 

350 355 360 

Gin Asp Pro Ala Gin Leu Ser Cys Leu Pro Tyr Leu Glu His Val 

365 370 375 

Gly His Asn His Thr Tyr Asn Met Ala Pro Ser Ala Leu Asp Ser 

380 385 390 
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Ala Asp Leu Pro Pro Pro Ser Ala Leu Lys Lys Gly Ser Lys Glu 

395 400 405 

Lys Gin Ala Asp Phe Leu Asp Lys Gin Met Ser Arg Asp Glu His 

410 415 420 

Arg Ala Arg Ala Met Lys lie Pro Phe Thr Asn Asp Lys lie He 

425 430 435 

Asn Leu Pro Val Glu Glu Phe Asn Glu Leu Leu Ser Lys Tyr Gin 

440 445 450 

Leu Ser Glu Ala Gin Leu Ser Leu He Arg Asp He Arg Arg Arg 

455 460 465 

Gly Lys Asn Lys Met Ala Ala Gin Asn Cys Arg Lys Arg Lys Leu 

470 475 480 

Asp Thr He Leu Asn Leu Glu Arg Asp Val Glu Asp Leu Gin Arg 

485 490 495 

Asp Lys Ala Arg Leu Leu Arg Glu Lys Val Glu Phe Leu Arg Ser 

500 505 510 

Leu Arg Gin Met Lys Gin Lys Val Gin Ser Leu Tyr Gin Glu Val 

515 520 525 

Phe Gly Arg Leu Arg Asp Glu Asn Gly Arg Pro Tyr Ser Pro Ser 

530 535 540 

Gin Tyr Ala Leu Gin Tyr Ala Gly Asp Gly Ser Val Leu Leu He 

545 550 555 

Pro Arg Thr Met Ala Asp Gin Gin Ala Arg Arg Gin Glu Arg Lys 

560 565 570 

Pro Lys Asp Arg Arg Lys 

575 

<210> 13 
<211> 573 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 4176657CD1 

<400> 13 

Met Thr Pro Glu Ser Arg Asp Thr Thr Asp Leu Ser Pro Gly Gly 
15 10 15 

Thr Gin Glu Met Glu Gly He Val He Val Lys Val Glu Glu Glu 

20 25 30 

Asp Glu Glu Asp His Phe Gin Lys Glu Arg Asn Lys Val Glu Ser 

35 40 45 

Ser Pro Gin Val Leu Ser Arg Ser Thr Thr Met Asn Glu Arg Ala 

50 55 60 

Leu Leu Ser Ser Tyr Leu Val Ala Tyr Arg Val Ala Lys Glu Lys 

65 70 75 

Met Ala His Thr Ala Ala Glu Lys He He Leu Pro Ala Cys Met 

80 85 90 

Asp Met Val Arg Thr He Phe Asp Asp Lys Ser Ala Asp Lys Leu 

95 100 105 

Arg Thr He Pro Leu Ser Asp Asn Thr He Ser Arg Arg He Cys 
110 115 120 

Thr He Ala Lys His Leu Glu Ala Met Leu He Thr Arg Leu Gin 
125 130 135 

Ser Gly He Asp Phe Ala He Gin Leu Asp Glu Ser Thr Asp He 
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140 145 150 

Ala Ser Cys Pro Thr Leu Leu Val Tyr Val Arg Tyr Val Tip Gin 

155 160 165 

Asp Asp Phe Val Glu Asp Leu Leu Cys Cys Leu Asn Leu Asn Ser 

170 175 180 

His lie Thr Gly Leu Asp Leu Phe Thr Glu Leu Glu Asn Cys Leu 

185 190 195 

Leu Gly Gin Tyr Lys Leu Asn Trp Lys His Cys Lys Gly lie Ser 

200 205 210 

Ser Asp Gly Thr Ala Asn Met Thr Gly Lys His Ser Arg Leu Thr 

215 220 225 

Qlu Lys Leu Leu Glu Ala Thr His Asn Asn Ala Val Trp Asn His 

230 235 240 

Cys Phe lie His Arg Glu Ala Leu Val Ser Lys Glu lie Ser Pro 

245 250 255 

Ser Leu Met Asp Val Leu Lys Asn Ala Val Lys Thr Val Asn Phe 

260 265 270 

lie Lys Gly Ser Ser Leu Asn Ser Arg Leu Leu Glu lie Phe Cys 

275 280 285 

Ser Glu He Gly Val Asn His Thr His Leu Leu Phe His Thr Glu 

290 295 300 

Val Arg Trp Leu Ser Gin Gly Lys Val Leu Ser Arg Val Tyr Glu 

305 310 315 

Leu Arg Asn Glu He Tyr He Phe Leu Val Glu Lys Gin Ser His 

320 325 330 

Leu Ala Asn He Phe Glu Asp Asp He Trp Val Thr Lys Leu Ala 

335 340 345 

Tyr Leu Ser Asp He Phe Gly He Leu Asn Glu Leu Ser Leu Lys 

350 355 360 

Met Gin Gly Lys Asn Asn Asp He Phe Gin Tyr Leu Glu His He 

365 370 375 

Leu Gly Phe Gin Lys Thr Leu Leu Leu Trp Gin Ala Arg Leu Lys 

380 385 390 

Ser Asn Arg Pro Ser Tyr Tyr Met Phe Pro Thr Leu Leu Gin His 

395 400 405 

He Glu Glu Asn He He Asn Glu Asp Cys Leu Lys Glu He Lys 

410 415 420 

Leu Glu He Leu Leu His Leu Thr Ser Leu Ser Gin Thr Phe Asn 

425 430 435 

Tyr Tyr Phe Pro Glu Glu Lys Phe Glu Ser Leu Lys Glu Asn He 

440 445 450 

Trp Met Lys Asp Pro Phe Ala Phe Gin Asn Pro Glu Ser He He 

455 460 465 

Glu Leu Asn Leu Glu Pro Glu Glu Glu Asn Glu Leu Leu Gin Leu 

470 475 480 

Ser Ser Ser Phe Thr Leu Lys Asn Tyr Tyr Lys He Leu Ser Leu 

485 490 495 

Ser Ala Phe Trp He Lys He Lys Asp Asp Phe Pro Leu Leu Ser 

500 505 510 

Arg Lys Ser He Leu Leu Leu Leu Pro Phe Thr Thr Thr Tyr Leu 

515 520 525 

Cys Glu Leu Gly Phe Ser He Leu Thr Arg Leu Lys Thr Lys Lys 

530 535 540 

Arg Asn Arg Leu Asn Ser Ala Pro Asp Met Arg Val Ala Leu Ser 

545 550 555 

Ser Cys Val Pro Asp Trp Lys Glu Leu Met Asn Arg Gin Ala His 
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560 



565 



570 



pro Ser His 



<210> 14 

<211> 501 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7506056CD1 

<400> 14 

Met Ala Pro Glu Arg Leu Arg Ser Arg Ala Leu Ser Ala Phe Lys 
15 10 15 

Leu Arg Qly Leu Leu Leu Arg Gly Glu Ala lie Lys Tyr Leu Thr 
20 25 30 

Glu Ala Leu Gin Ser lie Ser Glu Leu Glu Leu Glu Asp Lys Leu 
35 40 45 

Glu Lys lie He Asn Ala Val Glu Lys Gin Pro Leu Ser Ser Asn 
50 55 60 

Met He Glu Arg Ser Val Val Glu Ala Ala Val Gin Glu Cys Ser 
65 70 75 

Gin Ser Val Asp Glu Thr He Leu Leu Met Thr Asn His Pro Ala 
80 85 90 

Pro Asn Leu Phe Gly Thr Pro Arg Asp Lys Ala Glu Met Phe Arg 
95 100 105 

Glu Arg Tyr Thr He Leu His Gin Arg Thr His Arg His Glu Leu 

110 115 120 

Phe Thr Pro Pro Val He Gly Ser His Pro Asp Glu Ser Gly Ser 

125 130 135 

Lys Phe Gin Leu Lys Thr He Glu Thr Leu Leu Gly Ser Thr Thr 

140 145 150 

Lys He Gly Asp Ala He Val Leu Gly Met He Thr Gin Leu Lys 

155 160 165 

Glu Gly Lys Phe Phe Leu Glu Asp Pro Thr Gly Thr Val Gin Leu 

170 175 180 

Asp Leu Ser Lys Ala Gin Phe His Ser Gly Leu Tyr Thr Glu Ala 

185 190 195 

Cys Phe Val Leu Ala Glu Gly Trp Phe Glu Asp Gin Val Phe His 

200 205 210 

Val Asn Ala Phe Gly Phe Pro Pro Thr Glu Pro Ser Ser Thr Thr 

215 220 225 

Arg Ala Tyr Tyr Gly Asn He Asn Phe Phe Gly Gly Pro Ser Asn 

230 235 240 

Thr Ser Val Lys Thr Ser Ala Lys Leu Lys Gin Leu Glu Glu Glu 

245 250 255 

Asn Lys Asp Ala Met Phe Val Phe Leu Ser Asp Val Trp Leu Asp 

260 265 270 

Gin Val Glu Val Leu Glu Lys Leu Arg He Met Phe Ala Gly Tyr 

275 280 285 

Ser Pro Ala Pro Pro Thr Cys Phe He Leu Cys Gly Asn Phe Ser 

290 295 300 

Ser Ala Pro Tyr Gly Lys Asn Gin Val Gin Ala Leu Lys Asp Ser 



305 



310 



315 
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Leu Lys Thr Leu Ala Asp lie He Cys Glu Tyr Pro Asp He His 

320 325 330 

Gin Ser Ser Arg Phe Val Phe Val Pro Gly Pro Glu Asp Pro Gly 

335 340 345 

Phe Gly Ser He Leu Pro Arg Pro Pro Leu Ala Glu Ser He Thr 

350 355 360 

Asn Glu Phe Arg Gin Arg Val Pro Phe Ser Val Phe Thr Thr Aan 

365 370 375 

Pro Cys Arg He Gin Tyr Cys Thr Gin Glu lie Thr Val Phe Arg 

380 385 390 

Glu Asp Leu Val Asn Lys Met Cys Arg Asn Cys Val Arg Phe Pro 

395 400 405 

Ser Ser Asn Leu Ala He Pro Asn His Phe Val Lys Thr He Leu 

410 415 420 

Ser Gin Gly His Leu Thr Pro Leu Pro Leu Tyr Val Cys Pro Val 

425 430 435 

Tyr Trp Ala Tyr Asp Tyr Ala Leu Arg Val Tyr Pro Val Pro Asp 

440 445 450 

Leu Leu Val He Ala Asp Lys Tyr Asp Pro Phe Thr Thr Thr Asn 

455 460 465 

Thr Glu Cys Leu Cys He Asn Pro Gly Ser Phe Pro Arg Ser Gly 

470 475 480 

Phe Ser Phe Lys Val Phe Tyr Pro Ser Asn Lys Thr Val Glu Asp 

485 490 495 

Ser Lys Leu Gin Gly Phe 

500 

<210> 15 
<211> 893 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7506185CD1 

<400> 15 

Met Ala Asn Glu Thr Gin Lys Val Gly Ala He His Phe Pro Phe 

15 10 15 

Pro Phe Thr Pro Tyr Ser He Gin Glu Asp Phe Met Ala Glu Leu 

20 25 30 

Tyr Arg Val Leu Glu Ala Gly Lys He Gly He Phe Glu Ser Pro 

35 40 45 

Thr Gly Thr Gly Lys Ser Leu Ser Leu He Cys Gly Ala Leu Ser 

50 55 60 

Trp Leu Arg Asp Phe Glu Gin Lys Lys Arg Glu Glu Glu Ala Arg 

65 70 75 

Leu Leu Glu Thr Gly Thr Gly Pro Leu His Asp Glu Lys Asp Glu 

80 85 90 

Ser Leu Cys Leu Ser Ser Ser Cys Glu Gly Ala Ala Gly Thr Pro 

95 100 105 

Arg Pro Ala Gly Glu Pro Ala Trp Val Thr Gin Phe Val Gin Lys 

110 115 120 

Lys Glu Glu Arg Asp Leu Val Asp Arg Leu Lys Ala Glu Gin Ala 

125 130 135 

Arg Arg Lys Gin Arg Glu Glu Arg Leu Gin Gin Leu Gin His Arg 
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140 145 150 

Val Gin Leu Lys Tyr Ala Ala Lys Arg Leu Arg Gin Glu Glu Glu 

155 160 165 

Glu Arg Glu Asn Leu Leu Arg Leu Ser Arg Glu Met Leu Glu Thr 

170 175 180 

Gly Pro Glu Ala Glu Arg Leu Glu Gin Leu Glu Ser Gly Glu Glu 

185 190 195 

Glu Leu Val Leu Ala Glu Tyr Glu Ser Asp Glu Glu Lys Lys Val 

200 205 210 

Ala Ser Arg Val Asp Glu Asp Glu Asp Asp Leu Glu Glu Glu His 

215 220 225 

He Thr Lys He Tyr Tyr Cys Ser Arg Thr His Ser Gin Leu Ala 

230 235 240 

Gin Phe Val His Glu Val Lys Lys Ser Pro Phe Gly Lys Asp Val 

245 250 255 

Arg Leu Val Ser Leu Gly Ser Arg Gin Asn Leu Cys Val Asn Glu 

260 265 270 

Asp Val Lys Ser Leu Gly Ser Val Gin Leu He Asn Asp Arg Cys 

275 280 285 

Val Asp Met Gin Arg Ser Arg His Glu Lys Lys Lys Gly Ala Glu 

290 295 300 

Glu Glu Lys Pro Lys Arg Arg Arg Gin Glu Lys Gin Ala Ala Cys 

305 310 315 

Pro Phe Tyr Asn His Glu Gin Met Gly Leu Leu Arg Asp Glu Ala 

320 325 330 

Leu Ala Glu Val Lys Asp Met Glu Gin Leu Leu Ala Leu Gly Lys 

335 340 345 

Glu Ala Arg Ala Cys Pro Tyr Tyr Gly Ser Arg Leu Ala He Pro 

350 355 360 

Ala Ala Gin Leu Val Val Leu Pro Tyr Gin Met Leu Leu His Ala 

365 370 375 

Ala Thr Arg Gin Ala Ala Gly He Arg Leu Gin Asp Gin Val Val 

380 385 390 

He He Asp Glu Ala His Asn Leu He Asp Thr He Thr Gly Met 

395 400 405 

His Ser Val Glu Val Ser Gly Ser Gin Leu Cys Gin Ala His Ser 

410 415 420 

Gin Leu Leu Gin Tyr Val Glu Arg Tyr Gly Lys Arg Leu Lys Ala 

425 430 435 

Lys Asn Leu Met Tyr Leu Lys Gin He Leu Tyr Leu Leu Glu Lys 

440 445 450 

Phe Val Ala Val Leu Gly Gly Asn He Lys Gin Asn Pro Asn Thr 

455 460 465 

Gin Ser Leu Ser Gin Thr Gly Thr Glu Leu Lys Thr He Asn Asp 

470 475 480 

Phe Leu Phe Gin Ser Gin He Asp Asn He Asn Leu Phe Lys Val 

485 490 495 

Gin Arg Tyr Cys Glu Lys Ser Met He Ser Arg Lys Leu Phe Gly 

500 505 510 

Phe Thr Glu Arg Tyr Gly Ala Val Phe Ser Ser Arg Glu Gin Pro 

515 520 525 

Lys Leu Ala Gly Phe Gin Gin Phe Leu Gin Ser Leu Gin Pro Arg 

530 535 540 

Thr Thr Glu Ala Leu Ala Ala Pro Ala Asp Glu Ser Gin Ala Ser 

545 550 555 

Thr Leu Arg Pro Ala Ser Pro Leu Met His He Gin Gly Phe Leu 
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560 565 570 

Ala Ala Leu Thr Thr Ala Asn Gin Asp Gly Arg Val He Leu Ser 

575 580 585 

Arg Gin Gly Ser Leu Ser Gin Ser Thr Leu Lys Phe Leu Leu Leu 

590 595 600 

Asn Pro Ala Val His Phe Ala Gin Val Val Lys Glu Cys Arg Ala 

605 610 615 

Val Val He Ala Gly Gly Thr Met Gin Pro Val Ser Asp Phe Arg 

620 625 630 

Gin Gin Leu Leu Ala Cys Ala Gly Val Glu Ala Glu Arg Val Val 

635 640 645 

Glu Phe Ser Cys Gly His Val He Pro Pro Asp Asn He Leu Pro 

650 655 660 

Leu Val He Cys Ser Gly He Ser Asn Gin Pro Leu Glu Phe Thr 

665 670 675 

Phe Gin Lys Arg Glu Leu Pro Gin Met Met Asp Glu Val Gly Arg 

680 685 690 

He Leu Cys Asn Leu Cys Gly Val Val Pro Gly Gly Val Val Cys 

695 700 705 

Phe Phe Pro Ser Tyr Glu Tyr Leu Arg Gin Val His Ala His Trp 

710 715 720 

Glu Lys Gly Gly Leu Leu Gly Arg Leu Ala Ala Arg Lys Lys He 

725 730 735 

Phe Gin Glu Pro Lys Ser Ala His Gin Val Glu Gin Val Leu Leu 

740 745 750 

Ala Tyr Ser Arg Cys He Gin Ala Cys Gly Gin Glu Arg Gly Gin 

755 760 765 

Val Thr Gly Ala Leu Leu Leu Ser Val Val Gly Gly Lys Met Ser 

770 775 780 

Glu Gly He Asn Phe Ser Asp Asn Leu Gly Arg Ser Ala Glu Leu 

785 790 795 

Gin Glu Lys Met Ala Tyr Leu Asp Gin Thr Leu Pro Arg Ala Pro 

800 805 810 

Gly Gin Ala Pro Pro Gly Lys Ala Leu Val Glu Asn Leu Cys Met 

815 820 825 

Lys Ala Val Asn Gin Ser He Gly Arg Ala He Arg His Gin Lys 

830 835 840 

Asp Phe Ala Ser Val Val Leu Leu Asp Gin Arg Tyr Ala Arg Pro 

845 850 855 

Pro Val Leu Ala Lys Leu Pro Ala Trp He Arg Ala Arg Val Glu 

860 865 870 

Val Lys Ala Thr Phe Gly Pro Ala He Ala Ala Val Gin Lys Phe 

875 880 885 
His Arg Glu Lys Ser Ala Ser Ser 

890 

<210> 16 

<211> 555 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 8096611CD1 

<400> 16 
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Ser Gin His Ser Ser His Glu Asp Thr Asn Lys Asn He Asn Ser 

425 430 435 

Glu Asp He Val Glu Met Val Ser Leu Ala Ala Glu Glu Ser Ser 

440 445 450 

Leu Met Asn Asp Leu Phe Ala Gin Glu lie Glu Tyr Ser Pro Val 

455 460 465 

Glu Leu Glu Ser Leu Ser Asn Glu Glu Asn He Glu Thr Glu Arg 

470 475 480 

Trp Asn Gly Arg He Leu Gin Met Leu Asn Arg Leu Arg Glu Ser 

485 490 495 

Asn Lys Met Gly Met Gin Ser Phe Ser Leu Met Lys Leu Cys Arg 

500 505 510 

Asn Ser Asp Arg Lys Gin Ala Ala Ala Lys Phe Tyr Ser Phe Leu 

515 520 525 

Val Leu Lys Lys Gin Leu Ala He Glu Leu Ser Gin Ser Ala Pro 

530 535 540 

Tyr Ala Asp He He Ala Thr Met Gly Pro Met Phe Tyr Asn He 



<210> 17 

<211> 584 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 8174603CD1 

<400> 17 

Met Ala Ser Ala Ser Gin Gly Ala Asp Asp Asp Gly Ser Arg Arg 

15 10 15 

Lys Pro Arg Leu Ala Ala Ser Leu Gin He Ser Pro Gin Pro Arg 

20 25 30 

Pro Trp Arg Pro Leu Pro Ala Gin Ala Gin Ser Ala Trp Glu Pro 

35 40 45 

Cys Ala Arg Cys Pro Arg Arg He Ala Ala Glu Gly Gly Trp Pro 

50 55 60 

Gin Val Ser Val Leu Arg Asp Ser Gly Pro Gly Ala Gly Ala Gly 

65 70 75 

Val Gly Glu Leu Gly Ala Ala Arg Ala Trp Glu Asn Leu Gly Glu 

80 85 90 

Gin Met Gly Lys Ala Pro Arg Val Pro Val Pro Pro Ala Gly Leu 

95 100 105 

Ser Leu Pro Leu Lys Asp Pro Pro Ala Ser Gin Ala Val Ser Leu 

110 115 120 
Leu Thr Glu Tyr Ala Ala Ser Leu Gly He Phe Leu Leu Phe Arg 

125 130 135 

Glu Asp Gin Pro Pro Gly Glu Ala Gly Pro Cys Phe Pro Phe Ser 

140 145 150 
Val Ser Ala Glu Leu Asp Gly Val Val Cys Pro Ala Gly Thr Ala 

155 160 165 

Asn Ser Lys Thr Glu Ala Lys Gin Gin Ala Thr Leu Ser Ala Leu 

170 175 180 

Cys Tyr He Arg Ser Gin Leu Glu Asn Pro Val Gly Pro Leu Leu 



545 



550 



555 



185 



190 



195 
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Pro Ala 
Gin Arg 
Asp Glu 
Val lie 
Glu He 
Ala Gly 
Gly Leu 
Gin Leu 
Ser Val 
Lys Pro 
Gly Ala 
Leu Pro 
Gin Leu 
His Val 
Val Leu 
Leu Tyr 
Pro Thr 
Val Leu 
His Leu 
Asp Thr 
Gly He 
Ala Ala 
Arg Ala 
Leu Ala 
Glu Ala 
Gly Ala 



Val Ser Arg Pro 
200 

Cys Ala Ala Leu 
215 

Arg Ser Pro Tyr 
230 

Leu Glu Arg Glu 
245 

Tyr Lys Leu Val 
260 

Trp Leu Glu Phe 
275 

Val He Ala Arg 
290 

Leu Leu Ala Thr 
305 

Leu Ala Pro Gin 
320 

Arg Val Phe Leu 
335 

Ala Arg Asp He 
350 

His Ser Pro Pro 
365 

Lys Pro Val Cys 
380 

Gly Cys Leu Ser 
395 

Gly Leu Gly Gly 
410 

Ser Thr Ser Leu 
425 

Leu Ser Arg Ala 
440 

Gly Pro Cys Leu 
455 

Phe Ala Gly Pro 
470 

Cys Arg Gly Leu 
485 

Glu Val Val Asp 
500 

Leu Gly Pro Pro 
515 

Phe His Gin Ala 
530 

Leu Lys Thr Tyr 
545 

Arg Arg Gin Leu 
560 

Trp Pro Ser Lys 
575 



Gly Ala Glu 
Val Ser Ala 
Trp Ala Cys 
He Pro Arg 
Ala Leu Gly 
Ser Gly Gin 
Arg Ala Leu 
Gin Gly Gly 
Pro Gly Pro 
His Leu Tyr 
Tyr Leu Pro 
Met Arg Leu 
Tyr Val Ala 
Ala Ser Asp 
Ala Leu Leu 
He Leu Ala 
He His Thr 
Pro Pro Pro 
Pro Val Ala 
Ser Leu Asn 
Val Ala Thr 
Ser Arg Leu 
Ala Arg Ala 
Glu Ala Ala 
Ser Leu Leu 
Pro Leu Val 



Asn He Leu 
205 

Gly Phe Asp 
220 

Lys Gly Thr 
235 

Ala Arg Gly 
250 

Thr Gly Ser 
265 

Gin Leu His 
280 

Leu Arg Phe 
295 

Pro Lys Gly 
310 

Gly Pro Pro 
325 

He Ser Asn 
340 

Pro Thr Ser 
355 

Gin Ala His 
370 

Pro Ser Leu 
385 

Lys Leu Ala 
400 

Ala His Leu 
415 

Asp Ser Cys 
430 

Arg Pro Cys 
445 

Tyr Val Arg 
460 

Pro Ser Glu 
475 

Trp Ser Leu 
490 

Gly Arg Val 
505 

Cys Lys Ala 
520 

Val Gly Lys 
535 

Lys Ala Gly 
550 

Leu Asp Gin 
565 

Gly Lys Phe 
580 



Thr His Glu 
210 

Leu Leu Leu 
225 

Val Ala Gly 
240 

His Val Lys 
255 

Ser Cys Cys 
270 

Asp Cys His 
285 

Leu Phe Arg 
300 

Lys Glu Gin 
315 

Phe Thr Leu 
330 

Thr Pro Lys 
345 

Glu Gly Gly 
360 

Val Leu Gly 
375 

Cys Asp Thr 
390 

Arg Trp Ala 
405 

Val Ser Pro 
420 

His Asp Pro 
435 

Leu Asp Ser 
450 

Thr Ala Leu 
465 

Pro Thr Pro 
480 

Gly Asp Pro 
495 

Lys Ala Asn 
510 

Ser Phe Leu 
525 

Pro Tyr Leu 
540 

Pro Tyr Gin 
555 

Gin Gly Leu 
570 

Arg Asn 



<210> 18 
<211> 554 
<212> PRT 
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<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 3101042CD1 

<400> 18 

Met Leu Glu Asn Phe Arg Asn Leu Met Leu Val Arg Asp Gly lie 
15 10 15 

Lys Asn Asn lie Leu Asn Leu Gin Ala Lys Gly Leu Ser Tyr Leu 
20 25 30 

Ser Gin Glu Val Leu His Cys Trp Gin lie Trp Lys Gin Arg lie 
35 40 45 

Arg Asp Leu Thr Val Ser Gin Asp Tyr He Val Asn Leu Gin Glu 
50 55 60 

Glu Cys Ser Pro His Leu Glu Asp Val Ser Leu Ser Glu Glu Trp 
65 70 75 

Ala Gly He Ser Leu Gin He Ser Glu Asn Glu Asn Tyr Val Val 
80 85 90 

Asn Ala lie He Lys Asn Gin Asp He Thr Ala Trp Gin Ser Leu 
95 100 105 

Thr Gin Val Leu Thr Pro Glu Ser Trp Arg Lys Ala Asn He Met 

110 115 120 

Thr Glu Pro Gin Asn Ser Gin Gly Arg Tyr Lys Gly He Tyr Met 

125 130 135 

Glu Glu Lys Leu Tyr Arg Arg Ala Gin His Asp Asp Ser Leu Ser 

140 145 150 

Trp Thr Ser Cys Asp His His Glu Ser Gin Glu Cys Lys Gly Glu 

155 160 165 

Asp Pro Gly Arg His Pro Ser Cys Gly Lys Asn Leu Gly Met Lys 

170 175 180 

Ser Thr Val Glu Lys Arg Asn Ala Ala His Val Leu Pro Gin Pro 

185 190 195 

Phe Pro Cys Asn Asn Cys Gly Val Ala Phe Ala Asp Asp Thr Asp 

200 205 210 

Pro His Val His His Ser Thr His Leu Gly Glu Lys Ser Tyr Lys 

215 220 225 

Cys Asp Gin Tyr Gly Lys Asn Phe Ser Gin Ser Gin Asp Leu He 

230 235 240 

Val His Cys Lys Thr His Ser Gly Lys Thr Pro Tyr Glu Phe His 

245 250 255 

Glu Trp Pro Met Gly Cys Lys Gin Ser Ser Asp Leu Pro Arg Tyr 

260 265 270 

Gin Lys Val Ser Ser Gly Asp Lys Pro Tyr Lys Cys Lys Glu Cys 

275 280 285 

Gly Lys Gly Phe Arg Arg Ser Ser Ser Leu His Asn His His Arg 

290 295 300 

Val His Thr Gly Glu Met Pro Tyr Lys Cys Asp Glu Cys Gly Lys 

305 310 315 

Gly Phe Gly Phe Arg Ser Leu Leu Cys He His Gin Gly Val His 

320 325 330 

Thr Gly Lys Lys Pro Tyr Lys Cys Glu Glu Cys Gly Lys Gly Phe 

335 340 345 

Asp Gin Ser Ser Asn Leu Leu Val His Gin Arg Val His Thr Gly 

350 355 360 

Glu Lys Pro Tyr Lys Cys Ser Glu Cys Gly Lys Cys Phe Ser Ser 
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365 



370 



375 



Ser Ser Val Leu Gin Val His Trp Arg Phe His Thr Gly Glu Lys 

380 385 390 

Pro Tyr Arg Cys Gly Glu Cys Gly Lys Gly Phe Ser Gin Cys Thr 

395 400 405 

His Leu His lie His Gin Arg Val His Thr Gly Glu Lys Pro Tyr 

410 415 420 

Lys Cys Asn Val Cys Gly Lys Asp Phe Ala Tyr Ser Ser Val Leu 

425 430 435 

His Thr His Gin Arg Val His Thr Gly Glu Lys Pro Tyr Lys Cys 

440 445 450 

Glu Val Cys Gly Lys Cys Phe Ser Tyr Ser Ser Tyr Phe His Leu 

455 460 465 

His Gin Arg Asp His lie Arg Glu Lys Pro Tyr Lys Cys Asp Glu 

470 475 480 

Cys Gly Lys Gly Phe Ser Arg Asn Ser Asp Leu Asn Val His Leu 

485 490 495 

Arg Val His Thr Arg Glu Arg Pro Tyr Lys Cys Lys Ala Cys Gly 

500 505 510 

Lys Gly Phe Ser Arg Asn Ser Tyr Leu Leu Ala His Gin Arg Val 

515 520 525 

His lie Asp Glu Thr Gin Tyr Thr His Cys Glu Arg Gly Lys Asp 

530 535 540 

Leu Leu Thr His Gin Arg Leu His Glu Gin Arg Glu Thr Leu 



<210> 19 

<211> 1004 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 4972035CD1 

<400> 19 

Met Trp Glu Lys Met Glu Thr Lys Thr lie Val Tyr Asp Leu Asp 
1 ,5 10 15 

Thr Ser Gly Gly Leu Met Glu Gin lie Gin Ala Leu Leu Ala Pro 

20 25 30 

Pro Lys Thr Asp Glu Ala Glu Lys Arg Ser Arg Lys Pro Glu Lys 

35 40 45 

Glu Pro Arg Arg Ser Gly Arg Ala Thr Asn His Asp Ser Cys Asp 

50 55 60 

Ser Cys Lys Glu Gly Gly Asp Leu Leu Cys Cys Asp His Cys Pro 

65 70 75 

Ala Ala Phe His Leu Gin Cys Cys Asn Pro Pro Leu Ser Glu Glu 

80 85 90 

Met Leu Pro Pro Gly Glu Trp Met Cys His Arg Cys Thr Val Arg 

95 100 105 

Arg Lys Lys Arg Glu Gin Lys Lys Glu Leu Gly His Val Asn Gly 
110 115 120 

Leu Val Asp Lys Ser Gly Lys Arg Thr Thr Ser Pro Ser Ser Asp 
125 130 135 

Thr Asp Leu Leu Asp Arg Ser Ala Ser Lys Thr Glu Leu Lys Ala 



545 



550 



140 



145 



150 
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lie Ala His Ala Arg lie Leu Qlu Arg Arg Ala Ser Arg Pro Gly 

155 160 165 

Thr Pro Thr Ser Ser Ala Ser Thr Glu Thr Pro Thr Ser Glu Gin 

170 175 180 

Asn Asp Val Asp Glu Asp lie lie Asp Val Asp Glu Glu Pro Val 

185 190 195 

Ala Ala Glu Pro Asp Tyr Val Gin Pro Gin Leu Arg Arg Pro Phe 

200 205 210 

Glu Leu Leu lie Ala Ala Ala Met Glu Arg Asn Pro Thr Gin Phe 

215 220 225 

Gin Leu Pro Asn Glu Leu Thr Cys Thr Thr Ala Leu Pro Gly Ser 

230 235 240 
Ser Lys Arg Arg Arg Lys Glu Glu Thr Thr Gly Lys Asn Val Lys 

245 250 255 

Lys Thr Gin His Glu Leu Asp His Asn Gly Leu Val Pro Leu Pro 

260 265 270 

Val Lys Val Cys Phe Thr Cys Asn Arg Ser Cys Arg Val Ala Pro 

275 280 285 

Leu lie Gin Cys Asp Tyr Cys Pro Leu Leu Phe His Met Asp Cys 

290 295 300 
Leu Glu Pro Pro Leu Thr Ala Met Pro Leu Gly Arg Trp Met Cys 

305 310 315 

Pro Asn His lie Glu His Val Val Leu Asn Gin Lys Asn Met Thr 

320 325 330 

Leu Ser Asn Arg Cys Gin Val Phe Asp Arg Phe Gin Asp Thr Val 

335 340 345 

Ser Gin His Val Val Lys Val Asp Phe Leu Asn Arg lie His Lys 

350 355 360 
Lys His Pro Pro Asn Arg Arg Val Leu Gin Ser Val Lys Arg Arg 

365 370 375 

Ser Leu Lys Val Pro Asp Ala He Lys Ser Gin Tyr Gin Phe Pro 

380 385 390 

Pro Pro Leu He Ala Pro Ala Ala He Arg Asp Gly Glu Leu He 

395 400 405 

Cys Asn Gly He Pro Glu Glu Ser Gin Met His Leu Leu Asn Ser 

410 415 420 

Glu His Leu Ala Thr Gin Ala Glu Gin Gin Glu Trp Leu Cys Ser 

425 430 435 

Val Val Ala Leu Gin Cys Ser He Leu Lys His Leu Ser Ala Lys 

440 445 450 
Gin Met Pro Ser His Trp Asp Ser Glu Gin Thr Glu Lys Ala Asp 

455 460 465 

He Lys Pro Val He Val Thr Asp Ser Ser Val Thr Thr Ser Leu 

470 475 480 

Gin Thr Ala Asp Lys Thr Pro Thr Pro Ser His Tyr Pro Leu Ser 

485 490 495 

Cys Pro Ser Gly lie Ser Thr Gin Asn Ser Leu Ser Cys Ser Pro 

500 505 510 

Pro His Gin Ser Pro Ala Leu Glu Asp He Gly Cys Ser Ser Cys 

515 520 525 

Ala Glu Lys Ser Lys Lys Thr Pro Cys Gly Thr Ala Asn Gly Pro 

530 535 540 

Val Asn Thr Glu Val Lys Ala Asn Gly Pro His Leu Tyr Ser Ser 

545 550 555 

Pro Thr Asp Ser Thr Asp Pro Arg Arg Leu Pro Gly Ala Asn Thr 

560 565 * 570 
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Pro Leu Pro Gly Leu Ser His Arg Gin Gly Trp Pro Arg Pro Leu 

575 580 585 

Thr Pro Pro Ala Ala Gly Gly Leu Gin Asn His Thr Val Gly lie 

590 595 600 

lie Val Lys Thr Glu Asn Ala Thr Gly Pro Ser Ser Cys Pro Gin 

605 610 615 

Arg Ser Leu Val Pro Val Pro Ser Leu Pro Pro Ser lie Pro Ser 

620 625 630 

Ser Cys Ala Ser He Glu Asn Thr Ser Thr Leu Gin Arg Lys Thr 

635 640 645 

Val Gin Ser Gin He Gly Pro Pro Leu Thr Asp Ser Arg Pro Leu 

650 655 660 

Gly Ser Pro Pro Asn Ala Thr Arg Val Leu Thr Pro Pro Gin Ala 

665 670 675 

Ala Gly Asp Gly He Leu Ala Thr Thr Ala Asn Gin Arg Phe Ser 

680 685 690 

Ser Pro Ala Pro Ser Ser Asp Gly Lys Val Ser Pro Gly Thr Leu 

695 700 705 

Ser He Gly Ser Ala Leu Thr Val Pro Ser Phe Pro Ala Asn Ser 

710 715 720 

Thr Ala Met Val Asp Leu Thr Asn Ser Leu Arg Ala Phe Met Asp 

725 730 735 

Val Asn Gly Glu He Glu He Asn Met Leu Asp Glu Lys Leu He 

740 745 750 

Lys Phe Leu Ala Leu Gin Arg He His Gin Leu Phe Pro Ser Arg 

755 760 765 

Val Gin Pro Ser Pro Gly Ser Val Gly Thr His Gin Leu Ala Ser 

770 775 780 

Gly Gly His His He Glu Val Gin Arg Lys Glu Val Gin Ala Arg 

785 790 795 

Ala Val Phe Tyr Pro Leu Leu Gly Leu Gly Gly Ala Val Asn Met 

800 805 810 

Cys Tyr Arg Thr Leu Tyr He Gly Thr Gly Ala Asp Met Asp Val 

815 820 825 

Cys Leu Thr Asn Tyr Gly His Cys Asn Tyr Val Ser Gly Lys His 

830 835 840 

Ala Cys He Phe Tyr Asp Glu Asn Thr Lys His Tyr Glu Leu Leu 

845 850 855 

Asn Tyr Ser Glu His Gly Thr Thr Val Asp Asn Val Leu Tyr Ser 

860 865 870 

Cys Asp Phe Ser Glu Lys Thr Pro Pro Thr Pro Pro Ser Ser He 

875 880 885 

Val Ala Lys Val Gin Ser Val He Arg Arg Arg Arg His Gin Lys 

890 895 900 

Gin Asp Glu Glu Pro Ser Glu Glu Ala Ala Met Met Ser Ser Gin 

905 910 915 

Ala Gin Gly Pro Gin Arg Arg Pro Cys Asn Cys Lys Ala Ser Ser 

920 925 930 

Ser Ser Leu He Gly Gly Ser Gly Ala Gly Trp Glu Gly Thr Ala 

935 940 945 

Leu Leu His His Gly Ser Tyr He Lys Leu Gly Cys Leu Gin Phe 

950 955 960 

Val Phe Ser He Thr Glu Phe Ala Thr Lys Gin Pro Lys Gly Asp 

965 970 975 

Ala Ser Leu Leu Gin Asp Gly Val Leu Ala Glu Lys Leu Ser Leu 

980 985 990 
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Lys Pro His Gin Gly Pro Val Leu Arg Ser Asn Ser Val Pro 
995 1000 

<210> 20 

<211> 123 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7506265CD1 

<400> 20 

Met Ser Leu Val lie Pro Glu Lys Phe Gin His lie Leu Arg Val 
15 10 15 

Leu Asn Thr Asn lie Asp Gly Arg Arg Lys lie Ala Phe Ala lie 

20 25 30 

Thr Ala He Lys Val Glu Arg Val He Thr He Met Gin Asn Pro 

35 40 45 

Arg Gin Tyr Lys He Pro Asp Trp Phe Leu Asn Arg Gin Lys Asp 

50 55 60 

Val Lys Asp Gly Lys Tyr Ser Gin Val Leu Ala Asn Gly Leu Asp 

65 70 75 

Asn Lys Leu Arg Glu Asp Leu Glu Arg Leu Lys Lys He Arg Ala 

80 85 90 

His Arg Gly Leu Arg His Phe Trp Gly Leu Arg Val Arg Gly Gin 

95 100 105 

His Thr Lys Thr Thr Gly Arg Arg Gly Arg Thr Val Gly Val Ser 
110 115 120 

Lys Lys Lys 



<210> 21 

<211> 112 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7506304CD1 

<400> 21 

Met Ser Ala His Leu Gin Trp Met Val Val Arg Asn Cys Ser Ser 

15 10 15 

Phe Leu He Lys Arg Asn Lys Gin Thr Tyr Ser Thr Glu Pro Asn 

20 25 30 

Asn Leu Lys Ala Arg Asn Ser Phe Arg Tyr Asn Gly Leu He His 

35 40 45 

Arg Lys Thr Ala Thr Ser Tyr Val Arg Thr Thr He Asn Lys Asn 

50 55 60 

Ala Arg Ala Thr Leu Ser Ser He Arg His Met He Arg Lys Asn 

65 70 75 

Lys Tyr Arg Pro Asp Leu Arg Met Ala Ala He Arg Arg Ala Ser 

80 85 90 

Ala He Leu Arg Ser Gin Lys Pro Val Met Val Lys Arg Lys Arg 

95 100 105 
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Thr Arg Pro Thr Lys Ser Ser 
110 



<210> 22 

<211> 987 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7506198CD1 



<400> 22 
Met Asn His 
1 

Gly Glu Leu 
Thr Lys Asp 
Arg Glu Thr 
lie Met Val 
Glu Val Asn 
Gly Val Ser 
Lys Leu Asp 
Ala Glu Thr 
Ser Pro Phe 
Asp Asp Asp 
Asp Ala Asp 
Ala Ser Pro 
Pro Ser Glu 
Pro Leu Val 
Ala Met Glu 
Gly Asn Gin 
Phe Asp Tyr 
Leu Thr Val 
Leu Ala Ala 
Gly Leu Ala 
Ala Ala Pro 



Asp Phe Gin 
5 

Leu Pro Thr 
20 

Gly Gin Lys 
35 

Ala Trp Gly 
50 

Gin Arg Arg 
65 

Ala lie Leu 
80 

Met Val Glu 
95 

Ser Arg Phe 

110 
Asp Gly Pro 

125 
Glu Glu Asp 

140 
Ser Lys lie 

155 
Cys Lys Asp 

170 
Thr Glu Val 

185 
Gly Leu Gly 

200 
Glu Glu Phe 

215 
Gin Val Gly 

230 
Val Pro Met 

245 
Asn Ser Gin 

260 
Gin Gin Leu 

275 
Ala Gin Gin 

290 
Pro Ala Ala 

305 
Pro Gly Thr 



Ala Leu Ala 
Lys Lys Phe 
Gly lie Phe 
Ala Ser His 
Ser Gly Gin 
Ser Pro Arg 
Tyr Val Leu 
Arg Lys Gly 
Glu Lys Gly 
Gin Asn Arg 
Asn Gly Arg 
Phe Asn Arg 
Val Glu Arg 
Pro Leu Pro 
Ser Asn Pro 
Leu Glu Ser 
Asp Ser Ser 
Gin Gin Leu 
Thr Ala Ala 
Pro His lie 
Phe Val Pro 
Asp Pro Tyr 



Leu Glu Ser 
10 

Trp Glu Pro 
25 

Leu Gly Asp 
40 

His Ser Met 
55 

Gly Phe His 
70 

Ser Glu Ser 
85 

Ser Ser Ser 
100 

Asn Phe Gly 
115 

Asp Gin Lys 
130 

Asp Leu Lys 
145 

Gly Leu Pro 
160 

Thr Pro Gly 
175 

Leu Gly Pro 
190 

Asn Pro Thr 
205 

Glu Thr Gin 
220 

Leu Gin Phe 
235 

Gly Ala Thr 
250 

Phe Gin Arg 
265 

Gin Gin Gin 
280 

Ala Gly Val 
295 

Asn Pro Tyr 
310 

Thr Ala Ala 



Arg Gly Met 
15 

Asp Asp Ser 
30 

Asp Glu Trp 
45 

Ser Gin Pro 
60 

Gly Asn Ser 
75 

Gly Gly Leu 
90 

Pro Ala Asp 
105 

Thr Arg Asp 
120 

Gly Lys Ala 
135 

Gin Gly Asp 
150 

Asn Gly Met 
165 

Ser Arg Gin 
180 

Asn Thr Asn 
195 

Ala Asn Lys 
210 

Asn Leu Asp 
225 

Asp Tyr Pro 
240 

Val Gly Leu 
255 

Thr Asn Ala 
270 

Gin Tyr Ala 
285 

Phe Ser Ala 
300 

lie lie Ser 
315 

Gly Leu Ala 
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320 

Ala Ala Ala Thr Leu 
335 

Tyr Gly Val Pro Trp 
350 

Gin Ala Ala Ala Ala 
365 

Ser Gin Ala Gin Pro 
380 

Gly Gin Arg Pro Leu 
395 

Ala Glu Ser Leu Ala 
410 

Phe Gly Gin Gly Leu 
425 

Ala Pro Thr Ala Tyr 
440 

Pro Gly Ala Arg Thr 
455 

Pro Thr Pro Val Leu 
470 

Ala Ala Ala Ala Gly 
485 

Asn Gly Leu Phe Arg 
500 

Gin Gin Gin Pro Ser 
515 

Ser Ser Ser Leu Thr 
530 

His Gly Pro Gly Gin 
545 

Gly Asn Ser Leu Gly 
560 

Gly Ser Ser Gly Gly 
575 

Ala Ala Pro Gly Ala 
590 

Ser Ser Leu Phe Ser 
605 

Leu Arg Tyr Asn Arg 
620 

Leu Leu Glu Asp Phe 
635 

Arg Asp Leu lie Gly 
650 

Gly Ser Arg Phe lie 
665 

Glu Arg Gin Met Val 
680 

Leu Met Thr Asp Val 
695 

Glu Phe Gly Ser Leu 
710 

Arg Gly His Val Leu 
725 

Val He Gin Lys Ala 



325 

Ala Gly Pro Ala Val 
340 

Qly Val Tyr Pro Ala 
355 

Ala Asn Asn Thr Ala 
370 

Gly Gin Gin Gin Val 
385 

Thr Pro Asn Gin Gly 
400 

Ala Ala Ala Ala Ala 
415 

Ala Thr Gly Met Pro 
430 

Tyr Asp Gin Thr Gly 
445 

Gly Leu Gly Ala Pro 
460 

He Ser Ser Ala Ala 
475 

Gly Thr Ala Ser Ser 
490 

Pro He Gly Thr Gin 
505 

Thr Asn Leu Gin Ser 
520 

Asn Ser Ser Gin Ser 
535 

Pro Gly Ser Thr Ser 
550 

Ala Ala He Gly Ser 
565 

Leu Thr Asn Gly Ser 
580 

Glu Ala Lys Tyr Arg 
595 

Ser Ser Ser Gin Leu 
610 

Ser Asp He Met Pro 
625 

Arg Asn Asn Arg Phe 
640 

His He Val Glu Phe 
655 

Gin Gin Lys Leu Glu 
670 

Phe Asn Glu He Leu 
685 

Phe Gly Asn Tyr Val 
700 

Asp Gin Lys Leu Ala 
715 

Pro Leu Ala Leu Gin 
730 

Leu Glu Ser He Ser 



330 

Val Pro Pro Gin Tyr 
345 

Asn Leu Phe Gin Gin 
360 

Ser Gin Gin Ala Ala 
375 

Leu Arg Ala Gly Ala 
390 

Gin Gin Gly Gin Gin 
405 

Asn Pro Thr Leu Ala 
420 

Gly Tyr Gin Val Leu 
435 

Ala Leu Val Val Gly 
450 

Val Arg Leu Met Ala 
465 

Ala Gin Ala Ala Ala 
480 

Leu Thr Gly Ser Thr 
495 

Pro Pro Gin Gin Gin 
510 

Asn Ser Phe Tyr Gly 
525 

Ser Ser Leu Phe Ser 
540 

Leu Gly Phe Gly Ser 
555 

Ala Leu Ser Gly Phe 
570 

Gly Arg Tyr He Ser 
585 

Ser Ala Ser Ser Thr 
600 

Phe Pro Pro Ser Arg 
615 

Ser Gly Arg Ser Arg 
630 

Pro Asn Leu Gin Leu 
645 

Ser Gin Asp Gin His 
660 

Arg Ala Thr Pro Ala 
675 

Gin Ala Ala Tyr Gin 
690 

He Gin Lys Phe Phe 
705 

Leu Ala Thr Arg He 
720 

Met Tyr Gly Cys Arg 
735 

Ser Asp Gin Gin Val 
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740 



745 



750 



lie Ser Glu Met Val Lys Glu Leu Asp Gly His Val Leu Lys Cys 

755 760 765 

Val Lys Asp Gin Asn Gly Asn His Val Val Gin Lys Cys He Glu 

770 775 780 

Cys Val Gin Pro Gin Ser Leu Gin Phe He He Asp Ala Phe Lys 

785 790 795 

Gly Gin Val Phe Val Leu Ser Thr His Pro Tyr Gly Cys Arg Val 

800 805 810 

He Gin Arg He Leu Glu His Cys Thr Ala Glu Gin Thr Leu Pro 

815 820 825 

He Leu Glu Glu Leu His Gin His Thr Glu Gin Leu Val Gin Asp 

830 835 840 

Gin Tyr Gly Asn Tyr Val He Gin His Val Leu Glu His Gly Arg 

845 850 855 

Pro Glu Asp Lys Ser Lys He Val Ser Glu He Arg Gly Lys Val 

860 865 870 

Leu Ala Leu Ser Gin His Lys Phe Ala Ser Asn Val Val Glu Lys 

875 880 885 

Cys Val Thr His Ala Ser Arg Ala Glu Arg Ala Leu Leu He Asp 

890 895 900 

Glu Val Cys Cys Gin Asn Asp Gly Pro His Ser Ala Leu Tyr Thr 

905 910 915 

Met Met Lys Asp Gin Tyr Ala Asn Tyr Val Val Gin Lys Met He 

920 925 930 

Asp Met Ala Glu Pro Ala Gin Arg Lys He He Met His Lys He 

935 940 945 

Arg Pro His He Thr Thr Leu Arg Lys Tyr Thr Tyr Gly Lys His 

950 955 960 

He Leu Ala Lys Leu Glu Lys Tyr Tyr Leu Lys Asn Ser Pro Asp 

965 970 975 

Leu Gly Pro He Gly Gly Pro Pro Asn Gly Met Leu 



<210> 23 

<211> 1013 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 1381261CD1 

<400> 23 

Met Pro Pro Pro Ala Glu Val Thr Asp Pro Ser His Ala Pro Ala 
15 10 15 

Val Leu Arg Gin Leu Asn Glu Gin Arg Leu Arg Gly Leu Phe Cys 

20 25 30 

Asp Val Thr Leu He Ala Gly Asp Thr Lys Phe Pro Ala His Arg 

35 40 45 

Ser Val Leu Ala Ala Ser Ser Pro Phe Phe Arg Glu Ala Leu Leu 

50 55 60 

Thr Ser Ala Pro Leu Pro Leu Pro Pro Ala Thr Gly Gly Ala Ala 

65 70 75 

Pro Asn Pro Ala Thr Thr Thr Ala Ala Ser Ser Ser Ser Ser Ser 



980 



985 



80 



85 



90 
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Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ala Ser Ser Ser Ser 
95 100 105 

Ser Ser Ser Ser Ser Ser Pro Pro Pro Ala Ser Pro Pro Ala Ser 

110 115 120 

Ser Pro Pro Arg Val Leu Glu Leu Pro Gly Val Pro Ala Ala Ala 

125 130 135 

Phe Ser Asp Val Leu Asn Phe lie Tyr Ser Ala Arg Leu Ala Leu 

140 145 150 

Pro Gly Gly Gly Gly Asp Gly Ala Ala Val Ala Glu lie Gly Ala 

155 160 165 

Leu Gly Arg Arg Leu Gly lie Ser Arg Leu Gin Gly Leu Gly Glu 

170 175 180 

Gly Gly Asp Ala Trp Val Pro Pro Thr Pro Ala Pro Met Ala Thr 

185 190 195 

Ser Gin Pro Glu Glu Asp Ser Phe Gly Pro Gly Pro Arg Pro Ala 

200 205 210 

Gly Glu Trp Glu Gly Asp Arg Ala Glu Ala Gin Ala Pro Asp Leu 

?15 220 225 

Gin Cys Ser Leu Pro Arg Arg Pro Leu Pro Cys Pro Gin Cys Gly 

230 235 240 

Lys Ser Phe lie His Pro Lys Arg Leu Gin Thr His Glu Ala Gin 

245 250 255 

Cys Arg Arg Gly Ala Ser Thr Arg Gly Ser Thr Gly Leu Gly Ala 

260 265 270 

Gly Gly Ala Gly Pro Gly Gly Pro Ala Gly Val Asp Ala Ser Ala 

275 280 285 

Leu Pro Pro Pro Val Gly Phe Arg Gly Gly Pro Glu His Val Val 

290 295 300 

Lys Val Val Gly Gly His Val Leu Tyr Val Cys Ala Ala Cys Glu 

305 310 315 

Arg Ser Tyr Val Thr Leu Ser Ser Leu Lys Arg His Ser Asn Val 

320 325 330 

His Ser Trp Arg Arg Lys Tyr Pro Cys Arg Tyr Cys Glu Lys Val 

335 340 345 

Phe Ala Leu Ala Glu Tyr Arg Thr Lys His Glu Val Trp His Thr 

350 355 360 

Gly Glu Arg Arg Tyr Gin Cys lie Phe Cys Trp Glu Thr Phe Val 

365 370 375 

Thr Tyr Tyr Asn Leu Lys Thr His Gin Arg Ala Phe His Gly lie 

380 385 390 

Ser Pro Gly Leu Leu Ala Ser Glu Lys Thr Pro Asn Gly Gly Tyr 

395 400 405 

Lys Pro Lys Leu Asn Thr Leu Lys Leu Tyr Arg Leu Leu Pro Met 

410 415 420 

Arg Ala Ala Lys Arg Pro Tyr Lys Thr Tyr Ser Gin Gly Ala Pro 

425 430 435 

Glu Ala Pro Leu Ser Pro Thr Leu Asn Thr Pro Ala Pro Val Ala 

440 445 450 

Met Pro Ala Ser Pro Pro Pro Gly Pro Pro Pro Ala Pro Glu Pro 

455 460 465 

Gly Pro Pro Pro Ser Val lie Thr Phe Ala His Pro Ala Pro Ser 

470 475 480 

Val lie Val His Gly Gly Ser Ser Ser Gly Gly Gly Gly Ser Gly 

485 490 495 

Thr Ala Ser Thr Gly Gly Ser Gin Ala Ala Ser Val He Thr Tyr 

500 505 510 
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Thr Ala Pro Pro Arg Pro Pro Lys Lys Arg Glu Tyr Pro Pro Pro 

515 520 525 

Pro Pro Glu Pro Ala Ala Thr Pro Thr Ser Pro Ala Thr Ala Val 

530 535 540 

Ser Pro Ala Thr Ala Ala Gly Pro Ala Met Ala Thr Thr Thr Glu 

545 550 555 

Glu Ala Lys Gly Arg Asn Pro Arg Ala Gly Arg Thr Leu Thr Tyr 

560 565 570 

Thr Ala Lys Pro Val Gly Gly He Gly Gly Gly Gly Gly Pro Pro 

575 580 585 

Thr Gly Ala Gly Arg Gly Pro Ser Gin Leu Gin Ala Pro Pro Pro 

590 595 600 

Leu Cys Gin He Thr Val Arg He Gly Glu Glu Ala He Val Lys 

605 610 615 

Arg Arg He Ser Glu Thr Asp Leu Arg Pro Gly Glu Leu Ser Gly 

620 625 630 

Glu Glu Met Glu Glu Ser Glu Glu Asp Glu Glu Glu Glu Asp Glu 

635 640 645 

Glu Glu Glu Glu Glu Asp Glu Glu Glu Ser Lys Ala Gly Gly Glu 

650 655 660 

Asp Gin Leu Trp Arg Pro Tyr Tyr Ser Tyr Lys Pro Lys Arg Lys 

665 670 675 

Ala Gly Ala Ala Gly Gly Ala Ser Val Gly Gly Ser Gly Leu Pro 

680 685 690 

Arg Gly Arg Arg Pro Pro Arg Trp Arg Gin Lys Leu Glu Arg Arg 

695 700 705 

Ser Trp Glu Glu Thr Pro Ala Ala Glu Ser Pro Ala Gly Arg Ala 

710 715 720 

Arg Thr Glu Arg Arg His Arg Cys Gly Asp Cys Ala Gin Thr Phe 

725 730 735 

Thr Thr Leu Arg Lys Leu Arg Lys His Gin Glu Ala His Gly Gly 

740 745 750 

Gly Ser His Ser Ser Arg Ala Gly Arg Arg Pro Ser Thr Arg Phe 

755 760 765 

Thr Cys Pro His Cys Ala Lys Val Cys Lys Thr Ala Ala Ala Leu 

770 775 780 

Ser Arg His Gly Gin Arg His Ala Ala Glu Arg Pro Gly Gly Thr 

785 790 795 

Pro Thr Pro Val He Ala Tyr Ser Lys Gly Ser Ala Gly Thr Arg 

800 805 810 

Pro Gly Asp Val Lys Glu Glu Ala Pro Gin Glu Met Gin Val Ser 

815 820 825 

Ser Ser Ser Gly Glu Ala Gly Gly Gly Ser Thr Ala Ala Glu Glu 

830 835 840 

Ala Ser Glu Thr Ala Ser Leu Gin Asp Pro He He Ser Gly Gly 

845 850 855 

Glu Glu Pro Pro Val Val Ala Ser Gly Gly Ser Tyr Val Tyr Pro 

860 865 870 

Pro Val Gin Glu Phe Pro Leu Ala Leu He Gly Gly Gly Arg Glu 

875 880 885 

Pro Gly Gly Gly Arg Gly Lys Ser Gly Ser Glu Gly Pro Val Gly 

890 895 900 

Ala Gly Glu Gly Asp Arg Met Glu Gly He Gly Ala Ala Lys Val 

905 910 915 

Thr Phe Tyr Pro Glu Pro Tyr Pro Leu Val Tyr Gly Pro Gin Leu 

920 925 930 
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Leu Ala Ala Tyr Pro Tyr Asn Phe Ser Asn Leu Ala Ala Leu Pro 

935 940 945 

Val Ala Leu Asn Met Val Leu Pro Asp Glu Lys Qly Ala Gly Ala 

950 955 960 

Leu Pro Phe Leu Pro Gly Val Phe Gly Tyr Ala Val Asn Pro Gin 

965 970 975 

Ala Ala Pro Pro Ala Pro Pro Thr Pro Pro Pro Pro Thr Leu Pro 

980 985 990 

Pro Pro lie Pro Pro Lys Gly Glu Gly Glu Arg Ala Gly Val Glu 

995 1000 1005 

Arg Thr Gin Lys Gly Asp Val Gly 
1010 

<210> 24 
<211> 141 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> mi sc_ feature 

<223> Incyte ID No: 6803876CD1 

<400> 24 

Met Ala Glu Ser Asp Trp Asp Thr Val Thr Val Leu Arg Lys Lys 
15 10 15 

Gly Pro Thr Ala Ala Gin Ala Lys Ser Lys Gin Ala He Leu Ala 

20 25 30 

Ala Gin Arg Arg Gly Glu Asp Val Glu Thr Ser Lys Lys Trp Ala 

35 40 45 

Ala Gly Gin Asn Lys Gin His Ser He Thr Lys Asn Thr Ala Lys 

50 55 60 

Leu Asp Arg Glu Thr Glu Glu Leu His His Asp Arg Val Thr Leu 

65 70 75 

Glu Val Gly Lys Val He Gin Gin Gly Arg Gin Ser Lys Gly Leu 

80 85 90 

Thr Gin Lys Asp Leu Ala Thr Lys He Asn Glu Lys Pro Gin Val 

95 100 105 

He Ala Asp Tyr Glu Ser Gly Arg Ala He Pro Asn Asn Gin Val 
110 115 120 

Leu Gly Lys He Glu Arg Ala He Asp Val Gly Thr Arg Ser Ala 
125 130 135 

Arg Val Leu Arg Ala Gin 
140 

<210> 25 
<211> 334 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 7506281CD1 

<400> 25 

Met Asp Ala Asp Glu Gly Gin Asp Met Ser Gin Val Ser Gly Lys 
15 10 15 
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Glu Ser 
Met Pro 
Ser Ser 
His Ser 
Ser Leu 
Ser Ser 
Thr Pro 
Met Lys 
Tyr Leu 
Gly Gly 
His Lys 
Gin Asp 
Leu Val 
Asp Ser 
Leu lie 
Leu Ser 
Ala Ser 
Gly Glu 
Phe Leu 
Phe Arg 
Asp Arg 
Phe His 



Pro Pro Val 
20 

lie Pro Glu 
35 

Lys Ser Asp 
50 

Glu Met Ala 
65 

Val Leu Asp 
80 

Met Pro Gin 
95 

Tyr Asp Ser 
110 

Ser His Val 
125 

Gly Ala Glu 
140 

Ser Glu Val 
155 

Pro Leu Ala 
170 

Ser Ala Val 
185 

Pro Ser Glu 
200 

Thr Asp Thr 
215 

Tyr Leu Thr 
230 

Leu Lys Glu 
245 

Glu Asn Ser 
260 

Gin Met Lys 
275 

Asp His Val 
290 

Asp Pro Phe 
305 

Tyr Glu Phe 
320 

Met Ser 



Ser Asp 
Asp Leu 
Arg Val 
Glu Asp 
Arg Leu 
Lys Phe 
Ser Ala 
Met Asp 
Ser Leu 
Val Pro 
Glu Gly 
Glu Asn 
Arg Glu 
Glu Ser 
Asn His 
Glu His 
Gin Asp 
Val Tyr 
Met Tyr 
Glu Cys 
Ser Ser 



Thr Pro 
Ser Thr 
Val Val 
Leu Cys 
Ala Ser 
Leu Gly 
Ser Tyr 
Gin Ala 
Arg Pro 
Val He 
Thr Pro 
Leu Leu 
Ala Ser 
Asn Asn 
He Ala 
Arg Ala 
Ala Leu 
Lys Cys 
Thr He 
Asn Met 
His He 



Asp Glu 
25 

Thr Ser 

40 
He Lys 

55 
Lys He 

70 
Asn Val 

85 
Asp Lys 
100 

Glu Lys 
115 

He Asn 
130 

Leu Val 
145 

Ser Pro 
160 

Arg Ser 
175 

Leu Leu 
190 

Pro Ser 
205 

Glu Glu 
220 

Pro His 
235 

Tyr Asp 
250 

Arg Val 
265 

Glu His 
280 

His Met 
295 

Cys Gly 
310 

Thr Arg 
325 



Gly Asp 
Gly Gly 
Glu Glu 
Gly Ser 
Ala Lys 
Gly Leu 
Glu Asn 
Asn Ala 
Gin Thr 
Met Tyr 
Asn His 
Ser Lys 
Asn Ser 
Gin Arg 
Ala Arg 
Leu Leu 
Val Ser 
Cys Arg 
Gly Cys 
Tyr His 
Gly Glu 



Glu 
Gin 
Thr 
Glu 
Arg 
Ser 
Glu 
He 
Pro 
Gin 
Ser 
Ala 
Cys 
Ser 
Asn 
Arg 
Thr 
Val 
His 
Ser 
His 



Pro 

30 
Gin 

45 
Asn 

60 
Arg 

75 
Lys 

90 
Asp 
105 
Met 
120 
Asn 
135 
Pro 
150 
Leu 
165 
Ala 
180 
Lys 
195 
Gin 
210 
Gly 
225 
Gly 
240 
Ala 
255 
Ser 
270 
Leu 
285 
Gly 
300 
Gin 
315 
Arg 
330 



<210> 26 
<211> 439 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc„f eature 

<223> Incyte ID No: 7506175CD1 

<400> 26 

Met Asp Thr Ser Asp Leu Phe Ala Ser Cys Arg Lys Gly Asp Val 
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15 10 15 

Gly Arg Val Arg Tyr Leu Leu Glu Gin Arg Asp Val Glu Val Asn 
20 25 30 

Val Arg Asp Lys Trp Asp Ser Thr Pro Leu Tyr Tyr Ala Cys Leu 
35 40 45 

Cys Gly His Glu Glu Leu Val Leu Tyr Leu Leu Ala Asn Gly Ala 
50 55 60 

Arg Cys Glu Ala Asn Thr Phe Asp Gly Glu Arg Cys Leu Tyr Gly 
65 70 75 

Ala Leu Ser Asp Pro lie Arg Arg Ala Leu Arg Asp Tyr Lys Gin 
80 85 90 

Val Thr Ala Ser Cys Arg Arg Arg Asp Tyr Tyr Asp Asp Phe Leu 
95 100 105 

Gin Arg Leu Leu Glu Gin Gly lie His Ser Asp Val Val Phe Val 

110 115 120 

Val His Gly Lys Pro Phe Arg Val His Arg Cys Val Leu Gly Ala 

125 130 135 

Arg Ser Ala Tyr Phe Ala Asn Met Leu Asp Thr Lys Trp Lys Gly 

140 145 150 

Lys Ser Val Val Val Leu Arg His Pro Leu lie Asn Pro Val Ala 

155 160 165 

Phe Gly Ala Leu Leu Gin Tyr Leu Tyr Thr Val Ala Ser Lys Pro 

170 175 180 

Gly Thr Cys Val Lys Val Leu Thr lie Glu Pro Pro Pro Ala Asp 

185 190 195 

Pro Arg Leu Arg Glu Asp Met Ala Leu Leu Ala Asp Cys Ala Leu 

200 205 210 

Pro Pro Glu Leu Arg Gly Asp Leu Trp Glu Leu Pro Phe Pro Cys 

215 220 225 

Pro Asp Gly Phe Asn Ser Cys Pro Asp lie Cys Phe Arg Val Ala 

230 235 240 

Gly Cys Ser Phe Leu Cys His Lys Ala Phe Phe Cys Gly Arg Ser 

245 250 255 

Asp Tyr Phe Arg Ala Leu Leu Asp Asp His Phe Arg Glu Ser Glu 

260 265 270 

Glu Pro Ala Thr Ser Gly Gly Pro Pro Ala Val Thr Leu His Gly 

275 280 285 

lie Ser Pro Asp Val Phe Thr His Val Leu Tyr Tyr Met Tyr Ser 

290 295 300 

Asp His Thr Glu Leu Ser Pro Glu Ala Ala Tyr Asp Val Leu Ser 

305 310 315 

Val Ala Asp Met Tyr Leu Leu Pro Gly Leu Lys Arg Leu Cys Gly 

320 325 330 

Arg Ser Leu Ala Gin Met Leu Asp Glu Asp Thr Val Val Gly Val 

335 340 345 

Trp Arg Val Ala Lys Leu Phe Arg Leu Ala Arg Leu Glu Asp Gin 

350 355 360 

Cys Thr Glu Tyr Met Ala Lys Val lie Glu Lys Leu Val Glu Arg 

365 370 375 

Glu Asp Phe Val Glu Ala Val Lys Glu Glu Ala Ala Ala Val Ala 

380 385 390 

Ala Arg Gin Glu Thr Asp Ser lie Pro Leu Val Asp Asp lie Arg 

395 400 405 

Phe His Val Ala Ser Thr Val Gin Thr Tyr Ser Ala lie Glu Glu 

410 415 420 

Ala Gin Gin Arg Leu Arg Ala Leu Glu Asp Leu Leu Val Ser lie 
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425 



430 



435 



Gly Leu Asp Cys 



<210> 27 

<211> 448 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7506303CD1 

<400> 27 

Met Glu Thr lie Leu Glu Gin Gin Arg Arg Tyr His Glu Glu Lys 
15 10 15 

Glu Arg Leu Met Asp Val Met Ala Lys Glu Met Leu Thr Lys Lys 
20 25 30 

Ser Thr Leu Arg Asp Gin lie Asn Ser Asp His Arg Thr Arg Ala 
35 40 45 

Met Gin Asp lie Cys Val Pro Met Ser Val Glu Phe Glu Glu Leu 
50 55 60 

Leu Lys Ala Arg Glu Asn Pro Ser Glu Glu Ala Gin Asn Leu Val 
65 70 75 

Glu Phe Thr Asp Glu Glu Gly Tyr Gly Arg Tyr Leu Asp Leu His 
80 85 90 

Asp Cys Tyr Leu Lys Tyr lie Asn Leu Lys Ala Ser Glu Lys Leu 
95 100 105 

Asp Tyr lie Thr Tyr Leu Ser lie Phe Asp Gin Leu Phe Asp lie 

110 115 120 

Pro Lys Glu Arg Lys Asn Ala Glu Tyr Lys Arg Tyr Leu Glu Met 

125 130 135 

Leu Leu Glu Tyr Leu Gin Asp Tyr Thr Asp Arg Val Lys Pro Leu 

140 145 150 

Gin Asp Gin Asn Glu Leu Phe Gly Lys lie Gin Ala Glu Phe Glu 

155 160 165 

Lys Lys Trp Glu Asn Gly Thr Phe Pro Gly Trp Pro Lys Glu Thr 

170 175 180 

Ser Ser Ala Leu Thr His Ala Gly Ala His Leu Asp Leu Ser Ala 

185 190 195 

Phe Ser Ser Trp Glu Glu Leu Ala Ser Leu Gly Leu Asp Arg Leu 

200 205 210 

Lys Ser Ala Leu Leu Ala Leu Gly Leu Lys Cys Gly Gly Thr Leu 

215 220 225 

Glu Glu Arg Ala Gin Arg Leu Phe Ser Thr Lys Gly Lys Ser Leu 

230 235 240 

Glu Ser Leu Asp Thr Ser Leu Phe Ala Lys Asn Pro Lys Ser Lys 

245 250 255 

Gly Thr Lys Arg Asp Thr Glu Arg Asn Lys Asp He Ala Phe Leu 

260 265 270 

Glu Ala Gin He Tyr Glu Tyr Val Glu He Leu Gly Glu Gin Arg 

275 280 285 

His Leu Thr His Glu Asn Val Gin Arg Lys Gin Ala Arg Thr Gly 

290 295 300 

Glu Glu Arg Glu Glu Glu Glu Glu Glu Gin He Ser Glu Ser Glu 



305 



310 



315 
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Ser Glu 
Pro Leu 
Leu His 
Tyr Thr 
Trp Arg 
Ala His 
Trp Ala 
Asp Thr 
Lys Lys 



Asp Glu Glu 
320 

Gly Trp Asp 
335 

Gly Leu Asn 
350 

Tyr Arg Gly 
365 

His Ala His 
380 

Phe Ala Asn 
395 

Lys Leu Lys 
410 

Glu Glu Glu 
425 

Thr Tyr Glu 
440 



Asn Glu 
Gly Lys 
lie Asn 
Pro Lys 
Gly Met 
Val Thr 
Leu Gin 
Tyr Glu 
Asp Leu 



He He 
Pro He 
Tyr Asn 
Ala Phe 
Arg Cys 
Gin He 
Lys Ala 
Asp Ser 
Lys Arg 



Tyr Asn Pro 
325 

Pro Tyr Trp 
340 

Cys Glu He 
355 

Gin Arg His 
370 

Leu Gly He 
385 

Glu Asp Ala 
400 

Ser Glu Arg 
415 

Ser Gly Asn 
430 

Gin Gly Leu 
445 



Lys Asn 
Leu Tyr 
Cys Gly 
Phe Ala 
Pro Asn 
Val Ser 
Trp Gin 
Val Val 
Leu 



Leu 
330 
Lys 
345 
Asn 
360 
Glu 
375 
Thr 
390 
Leu 
405 
Pro 
420 
Asn 
435 



<210> 28 

<211> 104 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 7353336CD1 



<400> 28 
Met Pro Lys 
1 

Lys Val Lys 
Lys Pro Ala 
Ala Lys Lys 
Asp Ala Gly 
Lys Thr Asp 
Thr Asp Gin 



Arg Lys 
5 

Asp Glu 
20 

Pro Pro 
35 

Gly Glu 
50 

Lys Glu 
65 

Gin Ala 
80 

Ala Gin 
95 



Ala Glu 
Pro Gin 
Lys Pro 
Lys Val 
Gly Asn 
Gin Lys 
Lys Ala 



Gly Asp 
Arg Arg 
Glu Pro 
Pro Lys 
Asn Pro 
Ala Glu 
Glu Gly 



Ala Lys 

10 
Ser Ala 

25 
Lys Pro 

40 
Gly Lys 

55 
Ala Glu 

70 
Gly Ala 

85 
Ala Gly 
100 



Gly Asp 
Arg Leu 
Lys Lys 
Lys Gly 
Asn Gly 
Gly Asp 
Asp Ala 



Lys Ala 
15 

Ser Ala 
30 

Ala Pro 
45 

Lys Ala 
60 

Asp Ala 
75 

Ala Lys 
90 

Lys 



<210> 29 

<211> 769 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_£eature 

<223> Incyte ID No: 3001652CD1 



<400> 29 

Met Gly Pro Pro Leu Ala Pro Arg Pro Ala His Val Pro Gly Glu 
15 10 15 
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Ala Gly Pro Arg Arg Thr Arg Glu Ser Arg Pro Gly Ala Val Ser 
20 25 30 

Phe Ala Asp Val Ala Val Tyr Phe Ser Pro Glu Glu Tip Glu Cys 
35 40 45 

Leu Arg Pro Ala Gin Arg Ala Leu Tyr Arg Asp Val Met Arg Glu 
50 55 60 

Thr Phe Gly His Leu Gly Ala Leu Gly Phe Ser Val Pro Lys Pro 
65 70 75 

Ala Phe lie Ser Trp Val Glu Gly Glu Val Glu Ala Trp Ser Pro 
80 85 90 

Glu Ala Gin Asp Pro Asp Gly Glu Ser Ser Ala Ala Phe Ser Arg 
95 100 105 

Gly Gin Gly Gin Glu Ala Gly Ser Arg Asp Gly Asn Glu Glu Lys 

110 115 120 

Glu Arg Leu Lys Lys Cys Pro Lys Gin Lys Glu Val Ala His Glu 

125 130 135 

Val Ala Val Lys Glu Trp Trp Pro Ser Val Ala Cys Pro Glu Phe 

140 145 150 

Cys Asn Pro Arg Gin Ser Pro Met Asn Pro Trp Leu Lys Asp Thr 

155 160 165 

Leu Thr Arg Arg Leu Pro His Ser Cys Pro Asp Cys Gly Arg Asn 

170 175 180 

Phe Ser Tyr Pro Ser Leu Leu Ala Ser His Gin Arg Val His Ser 

185 190 195 

Gly Glu Arg Pro Phe Ser Cys Gly Gin Cys Gin Ala Arg Phe Ser 

200 205 210 

Gin Arg Arg Tyr Leu Leu Gin His Gin Phe lie His Thr Gly Glu 

215 220 225 

Lys Pro Tyr Pro Cys Pro Asp Cys Gly Arg Arg Phe Arg Gin Arg 

230 235 240 

Gly Ser Leu Ala lie His Arg Arg Ala His Thr Gly Glu Lys Pro 

245 250 255 

Tyr Ala Cys Ser Asp Cys Lys Ser Arg Phe Thr Tyr Pro Tyr Leu 

260 265 270 

Leu Ala lie His Gin Arg Lys His Thr Gly Glu Lys Pro Tyr Ser 

275 280 285 

Cys Pro Asp Cys Ser Leu Arg Phe Ala Tyr Thr Ser Leu Leu Ala 

290 295 300 

lie His Arg Arg lie His Thr Gly Glu Lys Pro Tyr Pro Cys Pro 

305 310 315 

Asp Cys Gly Arg Arg Phe Thr Tyr Ser Ser Leu Leu Leu Ser His 

320 325 330 

Arg Arg lie His Ser Asp Ser Arg Pro Phe Pro Cys Val Glu Cys 

335 340 345 

Gly Lys Gly Phe Lys Arg Lys Thr Ala Leu Glu Ala His Arg Trp 

350 355 360 

He His Arg Ser Cys Ser Glu Arg Arg Ala Trp Gin Gin Ala Val 

365 370 375 

Val Gly Arg Ser Glu Pro He Pro Val Leu Gly Gly Lys Asp Pro 

380 385 390 

Pro Val His Phe Arg His Phe Pro Asp He Phe Gin Glu Phe Cys 

395 400 405 

Gin Gin Arg Leu Gin Asp Arg Gly Val Pro Ser Asn Ala Pro Pro 

410 415 420 

Val Pro Gly Gin Ser Pro Arg Ser Phe Phe Arg Asp Arg Arg Gin 

425 430 435 
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Ser Ser Ala Val Ala Tyr Cys Gly 
440 

Ser Gly Pro Tyr lie Phe Leu Glu 
455 

Phe Pro Asp Thr Pro Pro Pro Pro 
470 

Ala Leu Phe Lys Gly Lys Trp Asp 
485 

Pro Pro Pro Ala Pro Leu Leu Ala 
500 

Pro Gly Cys Arg Lys Pro Gly Thr 
515 

Val Tyr Phe Ser Pro Glu Glu Trp 
530 

Arg Ala Leu Tyr Arg Asp Val Met 
545 

Gly Ala Leu Gly Phe Pro Gly Pro 
560 

Met Glu Gin Glu Ser Glu Ala Trp 
575 

Glu Lys Gly Glu Arg Leu Gly Gly 
590 

Asn Arg Lys Glu Glu Glu Pro Glu 
605 

Pro Arg Lys Ala Pro Val Lys Glu 
620 

Arg Asn Pro Asp Pro Ala lie Ser 
635 

Pro Pro Lys Asn Ala Ala Trp Asp 
650 

Pro Ala Pro lie Pro Ser Met Asp 
665 

His Val Cys Thr Asp Cys Gly Arg 
680 

Leu Val Ser His Arg Arg Met His 
695 

Cys Pro Glu Cys Gly Met Arg Phe 
710 

Ala His Gin Trp lie His Arg Ser 
725 

Arg Arg Pro Gly lie Arg Ala Val 
740 

Asp Arg Asp Pro Pro Val Leu Phe 
755 

Glu Glu Cys Gly 



<210> 30 

<211> 1081 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 1689128CD1 



Tyr 


Arg 


Gly Val Ser 


Glu 


Ala 




445 






450 


Gly 


Lys 


Lys Pro Leu 


Leu 


Tyr 




460 






465 


Leu 


Glu 


Lys Ala Ala 


Glu 


Ala 




475 






480 


Asp 


Glu 


Ala Arg Glu 


Met 


Ala 




490 






495 


Pro 


Arg 


Pro Gly Glu 


Thr 


Arg 




505 






510 


Val 


Ser 


Phe Ala Asp 


Val 


Ala 




520 






525 


Gly 


Cys 


Leu Arg Pro 


Ala 


Gin 




535 






540 


Gin 


Glu 


Thr Tyr Gly 


His 


Leu 




550 






555 


Lys 


Pro 


Ala Leu He 


Ser Trp 




565 






570 


Ser 


Pro 


Ala Ala Gin 


Asp 


Pro 




580 






585 


Ala 


Arg 


Arg Gly Asp 


Val 


Pro 




595 






600 


Glu 


Val 


Pro Arg Ala 


Lys Gly 




610 






615 


Ser 


Pro 


Glu Val Leu 


Val 


Glu 




625 






630 


Val 


Ala 


Pro Ala Arg 


Ala 


Gin 




640 






645 


Pro 


Thr 


Thr Gly Ala 


Gin 


Pro 




655 






660 


Ala 


Gin 


Ala Gly Gin 


Arg 


Arg 




670 






675 


Arg 


Phe 


Thr Tyr Pro 


Ser 


Leu 




685 






690 


Ser 


Gly 


Glu Arg Pro 


Phe 


Pro 




700 






705 


Lys 


Arg 


Lys Phe Ala 


Val 


Glu 




715 






720 


Cys 


Ser 


Gly Gly Arg 


Arg Gly 




730 






735 


Pro 


Arg 


Ala Pro Val 


Arg Gly 




745 






750 


Arg 


His 


Tyr Pro Asp 


He 


Phe 




760 






765 
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<400> 30 

Met Pro Arg Arg Lys Gin Glu Gin Pro Lys Arg Leu Pro Ser His 
15 10 15 

Val Ser Arg Gin Glu Glu Ala Glu Gly Glu Leu Ser Glu Gly Glu 
20 25 30 

His Trp Tyr Gly Asn Ser Ser Glu Thr Pro Ser Glu Ala Ser Tyr 
35 40 45 

Gly Glu Val Gin Glu Asn Tyr Lys Leu Ser Leu Glu Asp Arg lie 
50 55 60 

Gin Glu Gin Ser Thr Ser Pro Asp Thr Ser Leu Gly Ser Thr Thr 
65 70 75 

Pro Ser Ser His Thr Leu Glu Leu Val Ala Leu Asp Ser Glu Val 
80 85 90 

Leu Arg Asp Ser Leu Gin Cys Gin Asp His Leu Ser Pro Gly Val 
95 100 105 

Ser Ser Leu Cys Asp Asp Asp Pro Gly Ser Asn Lys Pro Leu Ser 

110 115 120 

Ser Asn Leu Arg Arg Leu Leu Glu Ala Gly Ser Leu Lys Leu Asp 

125 130 135 

Ala Ala Ala Thr Ala Asn Gly Arg Val Glu Ser Pro Val Asn Val 

140 145 150 

Gly Ser Asn Leu Ser Phe Ser Pro Pro Ser His His Ala Gin Gin 

155 160 165 

Leu Ser Val Leu Ala Arg Lys Leu Ala Glu Lys Gin Glu Gin Asn 

170 175 180 

Asp Gin Tyr Thr Pro Ser Asn Arg Phe lie Trp Asn Gin Gly Lys 

185 190 195 

Trp Leu Pro Asn Ser Thr Thr Thr Cys Ser Leu Ser Pro Asp Ser 

200 205 210 

Ala lie Leu Lys Leu Lys Ala Ala Ala Asn Ala Val Leu Gin Asp 

215 220 225 

Lys Ser Leu Thr Arg Thr Glu Glu Thr Met Arg Phe Glu Ser Phe 

230 235 240 

Ser Ser Pro Phe Ser Ser Gin Ser Ala Ser Ser Thr Leu Ala Ala 

245 250 255 

Leu Ser Lys Lys Val Ser Glu Arg Ser Leu Thr Pro Gly Gin Glu 

260 265 270 

His Pro Pro Pro Ala Ser Ser Phe Leu Ser Leu Ala Ser Met Thr 

275 280 285 

Ser Ser Ala Ala Leu Leu Lys Glu Val Ala Ala Arg Ala Ala Gly 

290 295 300 

Ser Leu Leu Ala Glu Lys Ser Ser Leu Leu Pro Glu Asp Pro Leu 

305 310 315 

Pro Pro Pro Pro Ser Glu Lys Lys Pro Glu Lys Val Thr Pro Pro 

320 325 330 

Pro Pro Pro Pro Pro Pro Pro Pro Pro Pro Pro Pro Pro Gin Ser 

335 340 345 

Leu Glu Leu Leu Leu Leu Pro Val Pro Lys Gly Arg Val Ser Lys 

350 355 360 

Pro Ser Asn Ser Ala Ser Glu Glu Glu Ser Gly Lys Pro Phe Gin 

365 370 375 

Cys Pro lie Cys Gly Leu Val lie Lys Arg Lys Ser Tyr Trp Lys 

380 385 390 

Arg His Met Val lie His Thr Gly Leu Lys Ser His Gin Cys Pro 

395 400 405 

Leu Cys Pro Phe Arg Cys Ala Arg Lys Asp Asn Leu Lys Ser His 
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410 



415 



420 



Met Lys Val His Gin His Gin Asp Arg Gly Glu Thr Phe Gin Cys 

425 430 435 

Gin Leu Cys Pro Phe Thr Ser Ser Arg His Phe Ser Leu Lys Leu 

440 445 450 

His Met Arg Cys His Gin His Phe Leu Arg Thr Glu Ala Lys Val 

455 460 465 

Lys Glu Glu lie Pro Asp Pro Asp Val Lys Gly Ser Pro His Leu 

470 475 480 

Ser Asp Ser Ala Cys Leu Gly Gin Gin Arg Glu Gly Gly Gly Thr 

485 490 495 

Glu Leu Val Gly Thr Met Met Thr Ser Asn Thr Pro Glu Arg Thr 

500 505 510 

Ser Gin Gly Gly Ala Gly Val Ser Pro Leu Leu Val Lys Glu Glu 

515 520 525 

Pro Lys Glu Asp Asn Gly Leu Pro Thr Ser Phe Thr Leu Asn Ala 



Tyr Val Ala Asn Ser Ala Ser Ala Leu Phe Ser Gin Asp lie Ser 

560 565 570 

Val Lys Met Ala Ser Asp Phe Leu Met Lys Leu Ser Ala Ala Asn 

575 580 585 

Gin Lys Glu Pro Met Asn Leu Asn Phe Lys Val Lys Glu Glu Pro 

590 595 600 

Lys Glu Gly Glu Ser Leu Ser Thr Thr Leu Pro Arg Ser Ser Tyr 

605 610 615 

Val Phe Ser Pro Glu Ser Glu Val Ser Ala Pro Gly Val Ser Glu 

620 625 630 

Asp Ala Leu Lys Pro Gin Glu Gly Lys Gly Ser Val Leu Arg Arg 

635 640 645 

Asp Val Ser Val Lys Ala Ala Ser Glu Leu Leu Met Lys Leu Ser 

650 655 660 

Ala Glu Ser Tyr Lys Glu Thr Gin Met Val Lys lie Lys Glu Glu 

665 670 675 

Pro Met Glu Val Asp lie Gin Asp Ser His Val Ser lie Ser Pro 

680 685 690 

Ser Arg Asn Val Gly Tyr Ser Thr Leu lie Gly Arg Glu Lys Thr 

695 700 705 

Glu Pro Leu Gin Lys Met Pro Glu Gly Arg Val Pro Pro Glu Arg 

710 715 720 

Asn Leu Phe Ser Gin Asp lie Ser Val Lys Met Ala Ser Glu Leu 

725 730 735 

Leu Phe Gin Leu Ser Glu Lys Val Ser Lys Glu His Asn His Thr 

740 745 750 

Lys Glu Asn Thr lie Arg Thr Thr Thr Ser Pro Phe Phe Ser Glu 

755 760 765 

Asp Thr Phe Arg Gin Ser Pro Phe Thr Ser Asn Ser Lys Glu Leu 

770 775 780 

Leu Pro Ser Asp Ser Val Leu His Gly Arg lie Ser Ala Pro Glu 

785 790 795 

Thr Glu Lys He Val Leu Glu Ala Gly Asn Gly Leu Pro Ser Trp 

800 805 810 

Lys Phe Asn Asp Gin Leu Phe Pro Cys Asp Val Cys Gly Lys Val 

815 820 825 

Phe Gly Arg Gin Gin Thr Leu Ser Arg His Leu Ser Leu His Thr 



530 535 
Ala Asp Arg Pro Ala Asn His Thr Lys Leu 
545 550 



540 

Lys Asp Pro Ser Glu 
555 
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830 



835 



840 



Glu Glu Arg Lys Tyr Lys Cys His Leu Cys Pro Tyr Ala Ala Lys 

845 850 855 

Cys Arg Ala Asn Leu Asn Gin His Leu Thr Val His Ser Val Lys 

860 865 870 

Leu Val Ser Thr Asp Thr Glu Asp lie Val Ser Ala Val Thr Ser 

875 880 885 

Glu Gly Ser Asp Gly Lys Lys His Pro Tyr Tyr Tyr Ser Cys His 

890 895 900 

Val Cys Gly Phe Glu Thr Glu Leu Asn Val Gin Phe Val Ser His 

905 910 915 

Met Ser Leu His Val Asp Lys Glu Gin Trp Met Phe Ser lie Cys 

920 925 930 

Cys Thr Ala Cys Asp Phe Val Thr Met Glu Glu Ala Glu lie Lys 

935 940 945 

Thr His lie Gly Thr Lys His Thr Gly Glu Asp Arg Lys Thr Pro 

950 955 960 

Ser Glu Ser Asn Ser Pro Ser Ser Ser Ser Leu Ser Ala Leu Ser 

965 970 975 

Asp Ser Ala Asn Ser Lys Asp Asp Ser Asp Gly Ser Gin Lys Asn 

980 985 990 

Lys Gly Gly Asn Asn Leu Leu Val lie Ser Val Met Pro Gly Ser 

995 1000 1005 

Gin Pro Ser Leu Asn Ser Glu Glu Lys Pro Glu Lys Gly Phe Glu 
1010 1015 1020 

Cys Val Phe Cys Asn Phe Val Cys Lys Thr Lys Asn Met Phe Glu 
1025 1030 1035 

Arg His Leu Gin lie His Leu lie Thr Arg Met Phe Glu Cys Asp 
1040 1045 1050 

Val Cys His Lys Phe Met Lys Thr Pro Glu Gin Leu Leu Glu His 
1055 1060 1065 

Lys Lys Cys His Thr Val Pro Thr Gly Gly Leu Asn Ser Gly Gin 
1070 1075 1080 

Trp 



<210> 31 
<211> 1007 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 2362969CD1 

<400> 31 

Met Glu Leu Asp Leu Asn Ser Ser Ser Glu Asp Asn Lys Pro Gly 
15 10 15 

Lys Arg Val Arg Thr Asn Ser Arg Ser Thr Pro Thr Thr Pro Gin 

20 25 30 

Gly Lys Pro Glu Thr Thr Phe Leu Asp Gin Gly Cys Ser Ser Pro 

35 40 45 

Val Leu lie Asp Cys Pro His Pro Asn Cys Asn Lys Lys Tyr Lys 

50 55 60 

His lie Asn Gly Leu Arg Tyr His Gin Ala His Ala His Leu Asp 



65 



70 



75 
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Pro Glu Asn 
Ser Asp Cys 
Glu Pro Ser 
Ala Ser Pro 
Arg Glu Leu 
Ala Gly Lys 
Asn Asn Leu 
Cys Ser Ala 
Glu Ala Glu 
Lys Gly Lys 
Lys Leu Lys 
Pro Pro Gin 
Thr Thr Gly 
Val Gin Ala 
Lys Pro Thr 
Val Ser Leu 
Asp Lys Glu 
Leu Gly Lys 
Gly His Phe 
Asn Gly Leu 
Ala Glu Ala 
Pro Ser He 
Asn Gly Gin 
Gly Ala Glu 
Asp He Ser 
Glu Gly Met 
Ser Lys Asp 
Ser Ser Gin 



Lys Leu Glu 
80 

Glu Glu Gly 
95 

Thr Ser Val 

110 
Gly Ala Gly 

125 
Met Ser Asn 

140 
Asn Ser Gly 

155 
Pro Val He 

170 
Ala Asp Gly 

185 
Gly Leu He 

200 
Lys Ala Thr 

215 
Ser Ala Arg 

230 
Leu He Ala 

245 
Thr He Pro 

260 
Thr Pro Lys 

275 
He Met Gly 

290 
Lys Asp Lys 

305 
Gly Lys Glu 

320 
Leu Glu Asp 

335 
Leu Lys Asp 

350 
Ser Glu Ser 

365 
Asp Lys Val 

380 
Gly Ser Ala 

395 
Ala Pro Met 

410 
Ser Ser Ala 

425 
Asp Ala Ala 

440 
Arg Ser Lys 

455 
Ser Val Val 

470 
Leu Lys Glu 

485 



Phe Glu Pro 
Leu Ser Asn 
Ser Ala Tyr 
Asn Pro' Pro 
Gly Pro Gly 
Lys Lys Lys 
Ser Asn Met 
Ser Leu Ala 
Asp Lys Lys 
Asn Cys Lys 
Pro He Ala 
He Pro Thr 
Gly Leu Pro 
Ser Pro Pro 
Glu Pro He 
Lys Lys Lys 
Thr Gly Ser 
Ser Lys Gly 
His Leu Asn 
Gin Glu Ser 
Tyr Thr Phe 
Ser Arg Leu 
Ala Pro Leu 
Ala Lys Thr 
Asp Asp Gly 
Ala Ser Ser 
Lys Gly His 
Ser His Ser 



Asp Ser Glu 
85 

Val Ala Leu 
100 

Asp Gin Leu 
115 

Gly Thr Pro 
130 

Ser He He 
145 

Gly Leu Asn 
160 

Thr Ala Ala 
175 

Ala Glu Met 
190 

Asn Leu Gly 
205 

Thr Asp Lys 
220 

Pro Ala Pro 
235 

Ala Thr Phe 
250 

Ser Leu Thr 
265 

Leu Lys Pro 
280 

Thr Val Asn 
295 

Glu Lys Arg 
310 

Pro Lys Met 
325 

Ala Ser Lys 
340 

Lys Asn Glu 
355 

Arg Met Ala 
370 

Thr Asp Asn 
385 

Glu Cys Ser 
400 

His Val Leu 
415 

Ser Ser Pro 
430 

Gly Ser Asp 
445 

Pro Ser Asp 
460 

Ser Ser Thr 
475 

Pro Tyr Tyr 
490 



Asp Lys He 
90 

Glu Cys Ser 
105 

Lys Ala Pro 
120 

Lys Gly Lys 
135 

Gly Ala Lys 
150 

Asn Glu Leu 
165 

Leu Asp Ser 
180 

Pro Lys Leu 
195 

Asp Lys Glu 
210 

Asn Leu Ser 
225 

Ala Pro Thr 
240 

Thr Thr Thr 
255 

Thr Thr Val 
270 

He Gin Pro 
285 

Pro Ala Leu 
300 

Lys Leu Lys 
315 

Asp Ala Lys 
330 

Asp Leu Pro 
345 

Gly Leu Ala 
360 

Ser He Lys 
375 

Ala Pro Ser 
390 

Thr Leu Val 
405 

Thr Gin Asn 
420 

Ala Tyr Ser 
435 

Ser Arg Ser 
450 

He He Ser 
465 

Thr Ala Gin 
480 

His Ser Tyr 
495 
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Asp Pro Tyr Tyr Ser 
500 

Ala Pro Ala Ala Gly 
515 

Lys Lys Glu Ser Glu 
530 

Gin Leu Asp Ser Lys 
545 

Pro Gin His Gin Ser 
560 

Gin Ser Leu Tyr Tyr 
575 

Gin Lys Ser Leu Met 
590 

Glu Lys Tyr Tyr Glu 
605 

Gin Thr Gly Arg Gly 
620 

Lys Glu Leu Gly Lys 
635 

Ala Thr lie Ser Lys 
650 

His Ser Lys Leu Gly 
665 

Gly Lys Ser Gin Leu 
680 

Asp Ser Phe Lys Ala 
695 

Glu Ala Val Glu Met 
710 

Gly Val Asp Pro Thr 
725 

Thr Trp His His Tyr 
740 

Lys Ser Glu Glu Leu 
755 

Pro Arg Lys Thr Pro 
770 

Val Ser Leu Thr Ser 
785 

Pro Asp Ser Gin Ser 
800 

Arg Lys Thr Pro Val 
815 

Ala Val Ser Ser Pro 
830 

Leu His Ala Tyr Pro 
845 

Ala Tyr Arg Ala Val 
860 

Ala Tyr Leu Ser Pro 
875 

Ser Gly Arg Glu Glu 
890 

Asn Thr Lys Thr Thr 
905 



Pro Ser Tyr Met His 
505 

Asn Ser Gly Ser Thr 
520 

Glu Asp Ala Glu Lys 
535 

Lys Val Asp His Asn 
550 

Val He Thr Gin Arg 
565 

Gly Gin Tyr Ala Tyr 
580 

Ala Thr Ser Pro Ala 
595 

Asp Gin Arg Leu Ala 
610 

Asp Cys Glu Arg Lys 
625 

Glu Glu Thr Lys Gin 
640 

Ala Pro Ser Thr Pro 
655 

Pro Ser Val Pro Asn 
670 

Leu Ser Asn His Gin 
685 

Lys Gin Met Glu Asn 
700 

Lys Ser Val Met Asp 
715 

Ser Arg Phe Lys Gin 
730 

Val Tyr Gin Pro Lys 
745 

Asp Arg Glu Lys Lys 
760 

Asn Lys Glu Ser Gly 
775 

He Lys Glu Glu Pro 
790 

Met Glu Glu Ser Lys 
805 

Asn Trp Lys Asp Ser 
820 

Met Ser Gin His Gin 
835 

Tyr Pro Gin Met Tyr 
850 

Ser Pro Val Leu Met 
865 

Gly Phe His Tyr Pro 
880 

Thr Glu Lys Val Asn 
895 

Thr Glu Ser Lys Ala 
910 



Pro Gly Gin Val Gly 
510 

Gin Gly Met Lys He 
525 

Lys Asp Lys Ala Glu 
540 

Ser Ala Ser Leu Gin 
555 

His Pro Ala Leu Ala 
570 

Gly Leu Tyr Met Asp 
585 

Tyr Arg Gin Gin Tyr 
600 

Glu Gin Lys Met Ala 
615 

Ser Glu Leu Pro Leu 
630 

Lys Asn Met Pro Ser 
645 

Glu Pro Asn Lys Asn 
660 

Lys Thr Glu Glu Thr 
675 

Gin Gin Leu Gin Ala 
690 

His Gin Leu He Lys 
705 

Ser Met Lys Gin Thr 
720 

Asp Pro Asp Ser Arg 
735 

Tyr Leu Asp Gin Gin 
750 

Leu Lys Glu Asp Ser 
765 

Val Pro Ser Leu Pro 
780 

Lys Glu Ala Lys His 
795 

Leu Lys Asn Asp Asp 
810 

Arg Gly Thr Arg Val 
825 

Ser Tyr He Gin Tyr 
840 

Asp Pro Ser His Pro 
855 

His Ser Tyr Pro Gly 
870 

Val Tyr Gly Lys Met 
885 

Thr Ser Pro Ser Val 
900 

Leu Asp Leu Leu Gin 
915 



48/141 



WO 03/054219 



PCT/US02/41115 



Gin His Ala Asn Gin Tyr Arg Ser Lys Ser Pro Ala Pro Val Glu 

920 925 930 

Lys Ala Thr Ala Glu Arg Glu Arg Glu Ala Glu Arg Glu Arg Asp 

935 940 945 

Arg His Ser Pro Phe Gly Gin Arg His Leu His Thr His His His 

950 955 960 

Thr His Val Gly Met Gly Tyr Pro Leu He Pro Gly Gin Tyr Asp 

965 970 975 

Pro Phe Gin Gly Leu Thr Ser Ala Ala Leu Val Ala Ser Gin Gin 

980 985 990 

Val Ala Ala Gin Ala Ser Ala Ser Gly Met Phe Pro Gly Gin Arg 

995 1000 1005 

Arg Glu 



<210> 32 

<211> 511 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_£eature 

<223> Incyte ID No: 4753527CD1 

<400> 32 

Met Ala Ala Ala He Thr Arg His Gly Arg Pro Gly Gly Ala Leu 
15 10 15 

Pro Pro Glu Pro Ser Ala Pro Arg Gin Pro Gly Phe Gly Gly Arg 
20 25 30 

Gly Arg Ala Glu Pro Pro Glu Glu Glu Glu Glu Glu Glu Glu Glu 
35 40 45 

Glu Glu Glu Ala Glu Ala Glu Ala Val Ala Ala Leu Leu Leu Asn 
50 55 60 

Gly Gly Ser Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Val Gly 
65 70 75 

Gly Gly Glu Ala Glu Thr Met Ser Glu Pro Ser Pro Glu Ser Ala 
80 85 90 

Ser Gin Ala Gly Glu Asp Glu Asp Glu Glu Glu Asp Asp Glu Glu 
95 100 105 

Glu Glu Asp Glu Ser Ser Ser Ser Gly Gly Gly Glu Glu Glu Ser 
110 115 120 

Ser Ala Glu Ser Leu Val Gly Ser Ser Gly Gly Ser Ser Ser Asp 
125 130 135 

Glu Thr Arg Ser Leu Ser Pro Gly Ala Ala Ser Ser Ser Ser Gly 
140 145 150 

Asp Gly Asp Gly Lys Glu Gly Leu Glu Glu Pro Lys Gly Pro Arg 
155 160 165 

Gly Ser Gin Gly Gly Gly Gly Gly Gly Ser Ser Ser Ser Ser Val 
170 175 180 

Val Ser Ser Gly Gly Asp Glu Gly Tyr Gly Thr Gly Gly Gly Gly 
185 190 195 

Ser Ser Ala Thr Ser Gly Gly Arg Arg Gly Ser Leu Glu Met Ser 
200 205 210 

Ser Asp Gly Glu Pro Leu Ser Arg Met Asp Ser Glu Asp Ser He 
215 220 225 

Ser Ser Thr He Met Asp Val Asp Ser Thr He Ser Ser Gly Arg 
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230 



235 



240 



Ser Thr Pro Ala Met Met Asn Gly Gin Gly Ser Thr Thr Ser Ser 

245 250 255 

Ser Lys Asn lie Ala Tyr Asn Cys Cys Trp Asp Gin Cys Gin Ala 

260 265 270 

Cys Phe Asn Ser Ser Pro Asp Leu Ala Asp His He Arg Ser He 

275 280 285 

His Val Asp Gly Gin Arg Gly Gly Val Phe Val Cys Leu Trp Lys 

290 295 300 

Gly Cys Lys Val Tyr Asn Thr Pro Ser Thr Ser Gin Ser Trp Leu 

305 310 315 

Gin Arg His Met Leu Thr His Ser Gly Asp Lys Pro Phe Lys Cys 

320 325 330 

Val Val Gly Gly Cys Asn Ala Ser Phe Ala Ser Gin Gly Gly Leu 

335 340 345 

Ala Arg His Val Pro Thr His Phe Ser Gin Gin Asn Ser Ser Lys 

350 355 360 

Val Ser Ser Gin Pro Lys Ala Lys Glu Glu Ser Pro Ser Lys Ala 

365 370 375 

Gly Met Asn Lys Arg Arg Lys Leu Lys Asn Lys Arg Arg Arg Ser 

380 385 390 

Leu Pro Arg Pro His Asp Phe Phe Asp Ala Gin Thr Leu Asp Ala 

395 400 405 

He Arg His Arg Ala He Cys Phe Asn Leu Ser Ala His He Glu 

410 415 420 

Ser Leu Gly Lys Gly His Ser Val Val Phe His Ser Thr Val He 

425 430 435 

Ala Lys Arg Lys Glu Asp Ser Gly Lys He Lys Leu Leu Leu His 

440 445 450 

Trp Met Pro Glu Asp He Leu Pro Asp Val Trp Val Asn Glu Ser 

455 460 465 

Glu Arg His Gin Leu Lys Thr Lys Val Val His Leu Ser Lys Leu 

470 475 480 

Pro Lys Asp Thr Ala Leu Leu Leu Asp Pro Asn He Tyr Arg Thr 

485 490 495 

Met Pro Gin Lys Arg Leu Lys Arg Phe Asp He Leu Asn Phe Pro 

500 505 510 

Arg 



<210> 33 

<211> 485 

<212> PKT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 6928688CD1 

<400> 33 

Met Lys Asn Ala Thr He Val Met Ser Val Arg Arg Glu Gin Gly 

15 10 15 

Ser Ser Ser Gly Glu Gly Ser Leu Ser Phe Glu Asp Val Ala Val 

20 25 30 

Gly Phe Thr Arg Glu Glu Trp Gin Phe Leu Asp Gin Ser Gin Lys 



35 



40 



45 
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Val Leu Tyr Lys Glu Val Met Leu Glu Asn Tyr lie Asn Leu Val 
50 55 60 

Ser lie Gly Tyr Arg Gly Thr Lys Pro Asp Ser Leu Phe Lys Leu 
65 70 75 

Glu Gin Gly Glu Pro Pro Gly lie Ala Glu Gly Ala Ala His Ser 
80 85 90 

Gin lie Cys Pro Gly Phe Val He Gin Ser Arg Arg Tyr Ala Gly 
95 100 105 

Lys Asp Ser Asp Ala Phe Gly Gly Tyr Gly Arg Ser Cys Leu His 

110 115 120 

He Lys Arg Asp Lys Thr Leu Thr Gly Val Lys Tyr His Arg Cys 

125 130 135 

Val Lys Pro Ser Ser Pro Lys Ser Gin Leu Asn Asp Leu Gin Lys 

140 145 150 

He Cys Ala Gly Gly Lys Pro His Glu Cys Ser Val Cys Gly Arg 

155 160 165 

Ala Phe Ser Arg Lys Ala Gin Leu He Gin His Gin Arg Thr Glu 

170 175 180 

Arg Gly Glu Lys Pro His Gly Cys Gly Glu Cys Gly Lys Thr Phe 

185 190 195 

Met Arg Lys He Gin Leu Thr Glu His Gin Arg Thr His Thr Gly 

200 205 210 

Glu Lys Pro His Glu Cys Ser Glu Cys Gly Lys Ala Phe Ser Arg 

215 220 225 

Lys Ser Gin Leu Met Val His Gin Arg Thr His Thr Gly Glu Lys 

230 235 240 

Pro Tyr Arg Cys Ser Glu Cys Gly Lys Ala Phe Ser Arg Lys Cys 

245 250 255 

Arg Leu Asn Arg His Gin Arg Ser His Thr Gly Glu Lys Leu Tyr 

260 265 270 

Gly Cys Ser Val Cys Gly Lys Ala Phe Ser Gin Lys Ala Tyr Leu 

275 280 285 

Thr Ala His Gin Arg Leu His Thr Gly Asp Lys Pro Tyr Lys Cys 

290 295 300 

Ser Asp Cys Gly Arg Thr Phe Tyr Phe Lys Ser Asp Leu Thr Arg 

305 310 315 

His Gin Arg He His Thr Gly Glu Lys Pro Tyr Glu Cys Ser Glu 

320 325 330 

Cys Glu Lys Ala Phe Arg Ser Lys Ser Lys Leu He Gin His Gin 

335 340 345 

Arg Thr His Thr Gly Glu Arg Pro Tyr Ser Cys Arg Glu Cys Gly 

350 355 360 

Lys Ala Phe Ala His Met Ser Val Leu He Lys His Glu Lys Thr 

365 370 375 

His He Arg Glu Thr Ala He Asn Ser Leu Thr Val Glu Lys Pro 

380 385 390 

Ser Ser Arg Ser His Thr Ser Leu Tyr Met Ser Glu Leu He Gin 

395 400 405 

Glu Gin Lys Thr Val Asn Thr Val Pro He Glu Met Pro Ser Ser 

410 415 420 

Gly Thr Pro Pro Leu Leu Asn Lys Ser Glu Arg Leu Val Gly Arg 

425 430 435 

Asn Val Val He Val Glu Gin Pro Phe Pro Arg Asn Gin Ala Phe 

440 445 450 

Val Val Asn Gin Glu Phe Glu Gin Arg He Ser Leu Thr Asn Glu 

455 460 465 
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Val Asn Val Ala Pro Ser Val He Asn Tyr He Leu Tyr Leu Thr 
470 475 480 

Asp He Val Ser Glu 
485 



<210> 34 
<211> 1011 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 7506388CD1 



<400> 34 

Met Val Ser Ser Asp Arg Pro Val Ser Leu Glu Asp Glu Val Ser 
15 10 15 

His Ser Met Lys Glu Met He Gly Gly Cys Cys Val Cys Ser Asp 
20 25 30 

Glu Arg Gly Trp Ala Glu Asn Pro Leu Val Tyr Cys Asp Gly His 
35 40 45 

Gly Cys Ser Val Ala Val His Gin Ala Cys Tyr Gly He Val Gin 
50 55 60 

Val Pro Thr Gly Pro Trp Phe Cys Arg Lys Cys Glu Ser Gin Glu 
65 70 75 

Arg Ala Ala Arg Val Arg Cys Glu Leu Cys Pro His Lys Asp Gly 
80 85 90 

Ala Leu Lys Arg Thr Asp Asn Gly Gly Trp Ala His Val Val Cys 
95 100 105 

Ala Leu Tyr He Pro Glu Val Gin Phe Ala Asn Val Ser Thr Met 

110 115 120 

Glu Pro He Val Leu Gin Ser Val Pro His Asp Arg Tyr Asn Lys 

125 130 135 

Thr Cys Tyr He Cys Asp Glu Gin Gly Arg Glu Ser Lys Ala Ala 

140 145 150 

Thr Gly Ala Cys Met Thr Cys Asn Lys His Gly Cys Arg Gin Ala 

155 160 165 

Phe His Val Thr Cys Ala Gin Phe Ala Gly Leu Leu Cys Glu Glu 

170 175 180 

Glu Gly Asn Gly Ala Asp Asn Val Gin Tyr Cys Gly Tyr Cys Lys 

185 190 195 

Tyr His Phe Ser Lys Leu Lys Lys Ser Lys Arg Gly Ser Asn Arg 

200 205 210 

Ser Tyr Asp Gin Ser Leu Ser Asp Ser Ser Ser His Ser Gin Asp 

215 220 225 

Lys His His Glu Lys Glu Lys Lys Lys Tyr Lys Glu Lys Asp Lys 

230 235 240 

His Lys Gin Lys His Lys Lys Gin Pro Glu Pro Ser Pro Ala Leu 

245 250 255 

Val Pro Ser Leu Thr Val Thr Thr Glu Lys Thr Tyr Thr Ser Thr 

260 265 270 

Ser Asn Asn Ser He Ser Gly Ser Leu Lys Arg Leu Glu Asp Thr 

275 280 285 

Thr Ala Arg Phe Thr Asn Ala Asn Phe Gin Glu Val Ser Ala His 

290 295 300 

Thr Ser Ser Gly Lys Asp Val Ser Glu Thr Arg Gly Ser Glu Gly 
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305 310 315 

Lys Gly Lys Lys Ser Ser Ala His Ser Ser Gly Gin Arg Gly Arg 

320 325 330 

Lys Pro Gly Gly Gly Arg Asn Pro Gly Thr Thr Val Ser Ala Ala 

335 340 345 

Ser Pro Phe Pro Gin Gly Ser Phe Ser Gly Thr Pro Gly Ser Val 

350 355 360 

Lys Ser Ser Ser Gly Ser Ser Val Gin Ser Pro Gin Asp Phe Leu 

365 370 375 

Ser Phe Thr Asp Ser Asp Leu Arg Asn Asp Ser Tyr Ser His Ser 

380 385 390 

Gin Gin Ser Ser Ala Thr Lys Asp Val His Lys Gly Glu Ser Gly 

395 400 405 

Ser Gin Glu Gly Gly Val Asn Ser Phe Ser Thr Leu lie Gly Leu 

410 415 420 

Pro Ser Thr Ser Ala Val Thr Ser Gin Pro Lys Ser Phe Glu Asn 

425 430 435 

Ser Pro Gly Asp Leu Gly Asn Ser Ser Leu Pro Thr Ala Gly Tyr 

440 445 450 

Lys Arg Ala Gin Thr Ser Gly lie Glu Glu Glu Thr Val Lys Glu 

455 460 465 

Lys Lys Arg Lys Gly Asn Lys Gin Ser Lys His Gly Pro Gly Arg 

470 475 480 

Pro Lys Gly Asn Lys Asn Gin Glu Asn Val Ser His Leu Ser Val 

485 490 495 

Ser Ser Ala Ser Pro Thr Ser Ser Val Ala Ser Ala Ala Gly Ser 

500 505 510 

lie Thr Ser Ser Ser Leu Gin Lys Ser Pro Thr Leu Leu Arg Asn 

515 520 525 

Gly Ser Leu Gin Ser Leu Ser Val Gly Ser Ser Pro Val Gly Ser 

530 535 540 

Glu He Ser Met Gin Tyr Arg His Asp Gly Ala Cys Pro Thr Thr 

545 550 555 

Thr Phe Ser Glu Leu Leu Asn Ala He His Asn Gly He Tyr Asn 

560 565 570 

Ser Asn Asp Val Ala Val Ser Phe Pro Asn Val Val Ser Gly Ser 

575 580 585 

Gly Ser Ser Thr Pro Val Ser Ser Ser His Leu Pro Gin Gin Ser 

590 595 600 

Ser Gly His Leu Gin Gin Val Gly Ala Leu Ser Pro Ser Ala Val 

605 610 615 

Ser Ser Ala Ala Pro Ala Val Ala Thr Thr Gin Ala Asn Thr Leu 

620 625 630 

Ser Gly Ser Ser Leu Ser Gin Ala Pro Ser His Met Tyr Gly Asn 

635 640 645 

Arg Ser Asn Ser Ser Met Ala Ala Leu He Ala Gin Ser Glu Asn 

650 655 660 

Asn Gin Thr Asp Gin Asp Leu Gly Asp Asn Ser Arg Asn Leu Val 

665 670 675 

Gly Arg Gly Ser Ser Pro Arg Gly Ser Leu Ser Pro Arg Ser Pro 

680 685 690 

Val Ser Ser Leu Gin lie Arg Tyr Asp Gin Pro Gly Asn Ser Ser 

695 700 705 

Leu Glu Asn Leu Pro Pro Val Ala Ala Ser He Glu Gin Leu Leu 

710 715 720 

Glu Arg Gin Trp Ser Glu Gly Gin Gin Phe Leu Leu Glu Gin Gly 
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725 



730 



735 



Thr Pro Ser Asp lie Leu Oly Met Leu Lys Ser Leu His Gin Leu 

740 745 750 

Gin Val Glu Asn Arg Arg Leu Glu Glu Gin He Lys Asn Leu Thr 

755 760 765 

Ala Lys Lys Glu Arg Leu Gin Leu Leu Asn Ala Gin Leu Ser Val 

770 775 780 

Pro Phe Pro Thr He Thr Ala Asn Pro Ser Pro Ser His Gin He 

785 790 795 

His Thr Phe Ser Ala Gin Thr Ala Pro Thr Thr Asp Ser Leu Asn 

800 805 810 

Ser Ser Lys Ser Pro His He Gly Asn Ser Phe Leu Pro Asp Asn 

815 820 825 

Ser Leu Pro Val Leu Asn Gin Asp Leu Thr Ser Ser Gly Gin Ser 

830 835 840 

Thr Ser Ser Ser Ser Ala Leu Ser Thr Pro Pro Pro Ala Gly Gin 

845 850 855 

Ser Pro Ala Gin Gin Gly Ser Gly Val Ser Gly Val Gin Gin Val 

860 865 870 

Asn Gly Val Thr Val Gly Ala Leu Ala Ser Gly Met Gin Pro Val 

875 880 885 

Thr Ser Thr He Pro Ala Val Ser Ala Val Gly Gly He He Gly 

890 895 900 

Ala Leu Pro Gly Asn Gin Leu Ala He Asn Gly He Val Gly Ala 

905 910 915 

Leu Asn Gly Val Met Gin Thr Pro Val Thr Met Ser Gin Asn Pro 

920 925 930 

Thr Pro Leu Thr His Thr Thr Val Pro Pro Asn Ala Thr His Pro 

935 940 945 

Met Pro Ala Thr Leu Thr Asn Ser Ala Ser Gly Leu Gly Leu Leu 

950 955 960 

Ser Asp Gin Gin Arg Gin He Leu He His Gin Gin Gin Phe Gin 

965 970 975 

Gin Leu Leu Asn Ser Gin Gin Leu Thr Pro Val His Arg His Pro 

980 985 990 

His Phe Thr Gin Leu Pro Pro Thr His Phe Ser Pro Ser Met Glu 

995 1000 1005 

He Met Gin Val Arg Lys 



<210> 35 
<211> 1675 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_£ eature 

<223> Incyte ID No: 7376372CD1 

<400> 35 

Met Thr Ser Ser Ser Pro Ala Gly Leu Glu Gly Ser Asp Leu Ser 
15 10 15 

Ser He Asn Thr Met Met Ser Ala Val Met Ser Val Gly Lys Val 
20 25 30 

Thr Glu Asn Gly Gly Ser Pro Gin Gly He Lys Ser Pro Ser Lys 



1010 



35 



40 



45 
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Pro Pro Gly Pro Asn Arg He Gly Arg Arg Asn Gin Glu Thr Lys 
50 55 60 

Glu Glu Lys Ser Ser Tyr Asn Cys Pro Leu Cys Glu Lys lie Cys 
65 70 75 

Thr Thr Gin His Gin Leu Thr Met His He Arg Gin His Asn Thr 
80 85 90 

Asp Thr Gly Gly Ala Asp His Ser Cys Ser He Cys Gly Lys Ser 
95 100 105 

Leu Ser Ser Ala Ser Ser Leu Asp Arg His Met Leu Val His Ser 

-110 115 120 

Gly Glu Arg Pro Tyr Lys Cys Thr Val Cys Gly Gin Ser Phe Thr 

125 130 135 

Thr Asn Gly Asn Met His Arg His Met Lys He His Glu Lys Asp 

140 145 150 

Pro Asn Ser Ala Thr Ala Thr Ala Pro Pro Ser Pro Leu Lys Arg 

155 160 165 

Arg Arg Leu Ser Ser Lys Arg Lys Leu Ser His Asp Ala Glu Ser 

170 175 180 

Glu Arg Glu Asp Pro Ala Pro Ala Lys Lys Met Val Glu Asp Gly 

185 190 195 

Gin Ser Gly Asp Leu Glu Lys Lys Ala Asp Glu Val Phe His Cys 

200 205 210 

Pro Val Cys Phe Lys Glu Phe Val Cys Lys Tyr Gly Leu Glu Thr 

215 220 225 

His Met Glu Thr His Ser Asp Asn Pro Leu Arg Cys Asp He Cys 

230 235 240 

Cys Val Thr Phe Arg Thr His Arg Gly Leu Leu Arg His Asn Ala 

245 250 255 

Leu Val His Lys Gin Leu Pro Arg Asp Ala Met Gly Arg Pro Phe 

260 265 270 

He Gin Asn Asn Pro Ser He Pro Ala Gly Phe His Asp Leu Gly 

275 280 285 

Phe Thr Asp Phe Ser Cys Arg Lys Phe Pro Arg He Ser Gin Ala 

290 295 300 

Trp Cys Glu Thr Asn Leu Arg Arg Cys He Ser Glu Gin His Arg 

305 310 315 

Phe Val Cys Asp Thr Cys Asp Lys Ala Phe Pro Met Leu Cys Ser 

320 325 330 

Val Ala Leu His Lys Gin Thr His Val Ala Ala Asp Gin Gly Gin 

335 340 345 

Glu Lys Pro Gin Ala Thr Pro Leu Pro Gly Asp Ala Leu Asp Gin 

350 355 360 

Lys Gly Phe Leu Ala Leu Leu Gly Leu Gin His Thr Lys Asp Val 

365 370 ' 375 

Arg Pro Ala Pro Ala Glu Glu Pro Leu Pro Asp Asp Asn Gin Ala 

380 385 390 

He Gin Leu Gin Thr Leu Lys Cys Gin Leu Pro Gin Asp Pro Gly 

395 400 405 

Cys Thr Asn Leu Leu Ser Leu Ser Pro Phe Glu Ala Ala Ser Leu 

410 415 420 

Gly Gly Ser Leu Thr Val Leu Pro Ala Thr Lys Asp Ser He Lys 

425 430 435 

His Leu Ser Leu Gin Pro Phe Gin Lys Gly Phe He He Gin Pro 

440 445 450 

Asp Ser Ser He Val Val Lys Pro He Ser Gly Glu Ser Ala He 

455 460 465 
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Glu Leu Ala Asp lie Gin Gin lie Leu Lys Met Ala Ala Ser Ala 

470 475 480 

Pro Pro Gin lie Ser Leu Pro Pro Phe Ser Lys Ala Pro Ala Ala 

485 490 495 

Pro Leu Gin Ala lie Phe Lys His Met Pro Pro Leu Lys Ala Lys 

500 505 510 

Pro Leu Val Thr Pro Arg Thr Val Val Gly His Leu His Ala Pro 

515 520 525 

Ala Ser His Gin Arg Gin Gin Ala Ser Arg Leu Tyr Gin Pro Gin 

530 535 540 

Pro Ala Ala Thr Ala Pro Glu Ala Pro Gin Arg Leu Thr Gly Gly 

545 550 555 

Gly Leu Gin Arg Pro Pro Ala Ala Val Gin Val Arg Asp Pro Ala 

560 565 570 

Pro Arg Gly His Ala Ala Leu Pro Ala Ala Gin Pro Arg Ala Glu 

575 580 585 

Leu Pro Gly Gin Pro Glu Met Lys Thr Gin Leu Glu Gin Asp Ser 

590 595 600 

lie lie Glu Ala Leu Leu Pro Leu Ser Met Glu Ala Lys lie Lys 

605 610 615 

Gin Glu lie Thr Glu Gly Glu Leu Lys Ala Phe Met Thr Ala Pro 

620 625 630 

Gly Gly Lys Lys Thr Pro Ala Met Arg Lys Val Leu Tyr Pro Cys 

635 640 645 

Arg Phe Cys Asn Gin Val Phe Ala Phe Ser Gly Val Leu Arg Ala 

650 655 660 

His Val Arg Ser His Leu Gly lie Ser Pro Tyr Gin Cys Asn lie 

665 670 675 

Cys Asp Tyr lie Ala Ala Asp Lys Ala Ala Leu lie Arg His Leu 

680 685 690 

Arg Thr His Ser Gly Glu Arg Pro Tyr lie Cys Lys lie Cys His 

695 700 705 

Tyr Pro Phe Thr Val Lys Ala Asn Cys Glu Arg His Leu Arg Lys 

710 715 720 

Lys His Leu Lys Ala Thr Arg Lys Asp lie Glu Lys Asn lie Glu 

725 730 735 

Tyr Val Ser Ser Ser Ala Ala Glu Leu Val Asp Ala Phe Cys Ala 

740 745 750 

Pro Asp Thr Val Cys Arg Leu Cys Gly Glu Asp Leu Lys His Tyr 

755 760 765 

Arg Ala Leu Arg lie His Met Pro Thr His Cys Gly Arg Gly Leu 

770 775 780 

Gly Gly Gly His Lys Gly Arg Lys Pro Phe Glu Cys Lys Glu Cys 

785 790 795 

Ser Ala Ala Phe Ala Ala Lys Arg Asn Cys lie His His lie Leu 

800 805 810 

Lys Gin His Leu His Val Pro Glu Gin Asp lie Glu Ser Tyr Val 

815 820 825 

Leu Ala Ala Asp Gly Leu Gly Pro Ala Glu Ala Pro Ala Ala Glu 

830 835 840 

Ala Ser Gly Arg Gly Glu Asp Ser Gly Cys Ala Ala Leu Gly Asp 

845 850 855 

Cys Lys Pro Leu Thr Ala Phe Leu Glu Pro Gin Asn Gly Phe Leu 

860 865 870 

His Arg Gly Pro Thr Gin Pro Pro Pro Pro His Val Ser lie Lys 

875 880 885 
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Leu Glu Pro Ala Ser Ser Phe Ala Val Asp Phe Asn Qlu Pro Leu 
890 895 900 

Asp Phe Ser Gin Lys Gly Leu Ala Leu Val Gin Val Lya Gin Qlu 
905 910 915 

Asn lie Ser Phe Leu Ser Pro Ser Ser Leu Val Pro Tyr Asp Cys 
920 925 930 

Ser Met Glu Pro lie Asp Leu Ser lie Pro Lys Asn Phe Arg Lys 
935 940 945 

Gly Asp Lys Asp Leu Ala Thr Pro Ser Glu Ala Lys Lys Pro Glu 
950 955 960 

Glu Glu Ala Gly Ser Ser Glu Gin Pro Ser Pro Cys Pro Ala Pro 
965 970 975 

Gly Pro Ser Leu Pro Val Thr Leu Gly Pro Ser Gly lie Leu Glu 
980 985 990 

Ser Pro Met Ala Pro Ala Pro Ala Ala Thr Pro Glu Pro Pro Ala 
995 1000 1005 

Gin Pro Leu Gin Gly Pro Val Gin Leu Ala Val Pro lie Tyr Ser 

1010 1015 1020 

Ser Ala Leu Val Ser Ser Pro Pro Leu Val Gly Ser Ser Ala Leu 

1025 1030 1035 

Leu Ser Gly Thr Ala Leu Leu Arg Pro Leu Arg Pro Lys Pro Pro 

1040 1045 1050 

Leu Leu Leu Pro Lys Pro Pro Val Thr Glu Glu Leu Pro Pro Leu 

1055 1060 1065 

Ala Ser lie Ala Gin lie lie Ser Ser Val Ser Ser Ala Pro Thr 

1070 1075 1080 

Leu Leu Lys Thr Lys Val Ala Asp Pro Gly Pro Ala Ser Thr Gly 

1085 1090 1095 

Ser Asn Thr Thr Ala Ser Asp Ser Leu Gly Gly Ser Val Pro Lys 

1100 1105 1110 

Ala Ala Thr Thr Ala Thr Pro Ala Ala Thr Thr Ser Pro Lys Glu 

1115 1120 1125 

Ser Ser Glu Pro Pro Ala Pro Ala Thr Gly Pro Glu Ala Ala Ser 

1130 1135 1140 

Pro Thr Glu Gin Gly Pro Ala Gly Thr Ser Lys Lys Arg Gly Arg 

1145 1150 1155 

Lys Arg Gly Met Arg Ser Arg Pro Arg Ala Asn Ser Gly Gly Val 

1160 1165 1170 

Asp Leu Tyr Ser Ser Gly Glu Phe Ala Ser lie Glu Lys Met Leu 

1175 1180 1185 

Ala Thr Thr Asp Thr Asn Lys Phe Ser Pro Phe Leu Gin Thr Ala 

1190 1195 1200 

Glu Asp Asn Thr Gin Asp Glu Val Ala Gly Ala Pro Ala Asp His 

1205 1210 1215 

His Gly Pro Ser Asp Glu Glu Gin Gly Ser Pro Pro Glu Asp Lys 

1220 1225 1230 

Leu Leu Arg Ala Lys Arg Asn Ser Tyr Thr Asn Cys Leu Gin Lys 

1235 1240 1245 

lie Thr Cys Pro His Cys Pro Arg Val Phe Pro Trp Ala Ser Ser 

1250 1255 1260 

Leu Gin Arg His Met Leu Thr His Thr Asp Ser Gin Ser Asp Ala 

1265 1270 1275 

Gly Asp Cys Ser Arg Arg Gly Arg Ser Gly Tyr Asp Leu Thr Ser 

1280 1285 1290 

Arg Asp Arg Glu Gin Pro Ser Glu Gly Ala Thr Glu Leu Arg Gin 

1295 1300 1305 
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Val Ala Gly Asp Ala Pro Val Qlu Gin Ala Thr Ala Glu Thr Ala 

1310 1315 1320 

Ser Arg Cys Thr Gly Lye Ser Thr Gly Val Gly Arg Ala Met Ser 

1325 1330 1335 

Arg Arg Arg Ser Met Ala Leu Arg Arg Ala Leu Gly Thr Pro Thr 

1340 1345 1350 

Arg Lys Arg Thr Arg Arg Ala Thr Arg Ala Trp Thr Trp Thr Ser 

1355 1360 1365 

Pro Pro Ser Ser Trp Thr Ser Ser Trp Arg Arg Ala Thr Arg Arg 

1370 1375 1380 

Gin Ala Gly Gly Ala Ala Ser Gin Glu Gin Lys Leu Ala Cys Asp 

1385 1390 1395 

Ala Cys Gly Lys Ser Phe Lys Phe Leu Gly Thr Leu Ser Arg His 

1400 1405 1410 

Arg Lys Ala His Gly Arg Pro Gly Gly Pro Arg Thr Arg Arg Glu 

1415 1420 1425 

Met Ala Pro Ala Leu Gin Arg Arg Gly Pro Ser Pro Pro Leu Asn 

1430 1435 1440 

Arg Arg Arg Ser Pro Pro Arg Pro Arg Gin Arg Trp Trp Ser Arg 

1445 1450 1455 

Pro Gly Ser Gly Arg Pro Arg Pro Glu Lys Leu Ala Glu Glu Thr 

1460 1465 1470 

Glu Gly Pro Ser Asp Gly Glu Ser Ala Ala Glu Lys Arg Ser Ser 

1475 1480 1485 

Glu Lys Ser Asp Asp Asp Lys Lys Pro Lys Thr Asp Ser Pro Lys 

1490 1495 1500 

Ser Val Ala Ser Lys Ala Asp Lys Arg Lys Lys Val Cys Ser Val 

1505 1510 1515 

Cys Asn Lys Arg Phe Trp Ser Leu Gin Asp Leu Ser Arg His Met 

1520 1525 1530 

Arg Ser His Thr Gly Glu Arg Pro Tyr Lys Cys Gin Thr Cys Glu 

1535 1540 1545 

Arg Thr Phe Thr Leu Lys His Ser Leu Val Arg His Gin Arg lie 

1550 1555 1560 

His Gin Lys Ala Arg His Ala Lys His His Gly Lys Asp Ser Asp 

1565 1570 1575 

Lys Glu Glu Arg Gly Glu Glu Asp Ser Glu Asn Glu Ser Thr His 

1580 1585 1590 

Ser Gly Asn Asn Ala Val Ser Glu Asn Glu Ala Glu Leu Ala Pro 

1595 1600 1605 

Asn Ala Ser Asn His Met Ala Val Thr Arg Ser Arg Lys Glu Gly 

1610 1615 1620 

Leu Ala Ser Ala Thr Lys Asp Cys Ser His Arg Glu Glu Lys Val 

1625 1630 1635 

Thr Gin Gly Gly Arg Leu Ser Leu Ala Arg Val Thr Leu Thr Gin 

1640 1645 1650 

Arg Ala Arg Arg Pro Trp Gly Arg Thr Cys Trp Ser Arg Ala Ala 

1655 1660 1665 

Arg Gly Leu Pro Thr Gin Ser Trp His Ser 

1670 1675 



<210> 36 

<211> 724 

<212> PRT 

<213> Homo sapiens 
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<220> 

<221> misc_f eature 

<223> Incyte ID No: 2754344CD1 

<400> 36 

Met Pro Val Arg Phe Lys Gly Leu Ser Glu Tyr Gin Arg Asn Phe 

15 10 15 

Leu Trp Lys Lys Ser Tyr Leu Ser Glu Ser Cys Asn Ser Ser Val 

20 25 30 

Gly Arg Lys Tyr Pro Trp Ala Gly Leu Arg Ser Asp Gin Leu Gly 

35 40 45 

lie Thr Lys Glu Pro Ser Phe lie Ser Lys Arg Arg Val Pro Tyr 

50 55 60 

His Asp Pro Gin lie Ser Lys Ser Leu Glu Trp Asn Gly Ala lie 

65 70 75 

Ser Glu Ser Asn Val Val Ala Ser Pro Glu Pro Glu Ala Pro Glu 

80 85 90 

Thr Pro Lys Ser Gin Glu Ala Glu Gin Lys Asp Val He Gin Glu 

95 100 105 

Arg Val His Ser Leu Glu Ala Ser Arg Val Pro Lys Arg Thr Arg 

110 115 120 

Ser His Ser Ala Asp Ser Arg Ala Glu Gly Ala Ser Asp Val Glu 

125 130 135 

Asn Asn Glu Gly Val Thr Asn His Thr Pro Val Asn Glu Asn Val 

140 145 150 

Glu Leu Glu His Ser Thr Lys Val Leu Ser Glu Asn Val Asp Asn 

155 160 165 

Gly Leu Asp Arg Leu Leu Arg Lys Lys Ala Gly Leu Thr Val Val 

170 175 180 

Pro Ser Tyr Asn Ala Leu Arg Asn Ser Glu Tyr Gin Arg Gin Phe 

185 190 195 

Val Trp Lys Thr Ser Lys Glu Thr Ala Pro Ala Phe Ala Ala Asn 

200 205 210 

Gin Val Phe His Asn Lys Ser Gin Phe Val Pro Pro Phe Lys Gly 

215 220 225 

Asn Ser Val He His Glu Thr Glu Tyr Lys Arg Asn Phe Lys Gly 

230 235 240 

Leu Ser Pro Val Lys Glu Pro Lys Leu Arg Asn Asp Leu Arg Glu 

245 250 255 

Asn Arg Asn Leu Glu Thr Val Ser Pro Glu Arg Lys Ser Asn Lys 

260 265 270 

He Asp Asp Arg Leu Lys Leu Glu Ala Glu Met Glu Leu Lys Asp 

275 280 285 

Leu His Gin Pro Lys Arg Lys Leu Thr Pro Trp Lys His Gin Arg 

290 295 300 

Leu Gly Lys Val Asn Ser Glu Tyr Arg Ala Lys Phe Leu Ser Pro 

305 310 315 

Ala Gin Tyr Leu Tyr Lys Ala Gly Ala Trp Thr His Val Lys Gly 

320 325 330 

Asn Met Pro Asn Gin Gly Ser Leu Asn Ala Met Trp Tyr Ala Glu 

335 340 345 

Val Lys Glu Leu Arg Glu Lys Ala Glu Phe Tyr Arg Lys Arg Val 

350 355 360 

Gin Gly Thr His Phe Ser Arg Asp His Leu Asn Gin He Leu Ser 

365 370 375 

Asp Ser Asn Cys Cys Trp Asp Val Ser Ser Thr Thr Ser Ser Glu 
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Gly Thr He 
Pro Thr Ser 
Glu Glu Lys 
Thr Glu Lys 
Arg Leu Ala 
Gin Pro Gly 
Asp Arg Lys 
Leu Asp Val 
Ser Asp Ser 
Pro Lys Leu 
Leu Thr Thr 
LyB Met Lys 
Gin Asp Cys 
Ser Arg Ala 
Thr Val Asp 
His Lys Phe 
Tyr Pro Thr 
Pro Ser Tyr 
Asp Pro Glu 
Leu Gin Leu 
Arg Leu Ser 
Ala Phe Gin 
Trp Gly Lys 



380 
Ser Ser Asn 

395 
His Lys Thr 

410 
Gly Asn He 

425 
Leu Gly Val 

440 
Trp Asp Thr 

455 
Glu Lys Glu 

470 
Thr Gly Lys 

485 
His Glu Lys 

500 
Ser Val Ser 

515 
Arg Glu Leu 

530 
Pro Ala Val 

545 
Pro Pro Ala 

560 
Leu Glu Thr 

575 
Val Ser Leu 

590 
Pro Leu Pro 

605 
Ala Glu Ala 

620 
Asn Pro Pro 

635 
Trp His Pro 

650 
Phe Gin His 

665 
Pro Gin His 

680 
Glu He Ser 

695 
Thr Leu Ala 

710 

Thr 



He Arg Ala 
Leu Gin Lys 
Val Glu Glu 
Ser Ala Pro 
Glu Asn Thr 
Glu Glu Asp 
Gin Ala Phe 
Ser Lys Ala 
Ser Glu Lys 
Gly Gly He 
Gly Gly Ala 
Leu Glu Gin 
Ser Lys Asn 
Leu Thr Ser 
Leu Arg Glu 
Thr Leu Pro 
Gly Gin Leu 
Ser Arg Arg 
Asn Val Gly 
Glu Ala Phe 
Ala Arg Ser 
Arg Ala Lys 



385 

Leu Asp Leu 
400 

Cys Pro Ser 
415 

Gin Pro Gin 
430 

Thr He Pro 
445 

Ser Glu Asp 
460 

Asp Asn Glu 
475 

Met Gly Glu 
490 

Asp Lys Met 
505 

Gly Gly Arg 
520 

Gin Arg Thr 
535 

Val Leu Val 
550 

Arg Lys Arg 
565 

Asp Phe Thr 
580 

Pro Ala Ala 
595 

Asp Ser Glu 
610 

Val Ser Lys 
625 

Pro Ser Pro 
640 

He Gin Gly 
655 

Lys Ala Arg 
670 

Asn Asp Glu 
685 

Ala Ala Ser 
700 

Lys Arg Lys 
715 



390 

Ala Gly Asp 
405 

Thr Glu Pro 
420 

Lys Asn Thr 
435 

Val Arg Arg 
450 

Val Gin Lys 
465 

Glu Glu Gly 
480 

Gin Glu Lys 
495 

Lys Glu Gly 
510 

Leu Pro Thr 
525 

His His Asp 
540 

Ser Pro Ser 
555 

Met Thr Ser 
570 

Lys Lys Glu 
585 

Gly He Lys 
600 

Asp Asn He 
615 

He Pro Lys 
630 

Pro His Val 
645 

Ser Leu Arg 
660 

Met Asn Asn 
675 

Asp Glu Asp 
690 

Ser Leu Arg 
705 

Glu Asn Phe 
720 



<210> 37 
<211> 605 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 8268822CD1 
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<400> 37 

Met Glu He Gly Thr Glu He Ser Arg Lys He Arg Ser Ala He 
15 10 15 

Lys Gly Lys Leu Gin Glu Leu Gly Ala Tyr Val Asp Glu Glu Leu 
20 25 30 

Pro Asp Tyr He Met Val Met Val Ala Asn Lys Lys Ser Gin Asp 
35 40 45 

Gin Met Thr Glu Asp Leu Ser Leu Phe Leu Gly Asn Asn Thr He 
50 55 60 

Arg Phe Thr Val Trp Leu His Gly Val Leu Asp Lys Leu Arg Ser 
65 70 75 

Val Thr Thr Glu Pro Ser Ser Leu Lys Ser Ser Asp Thr Asn He 
80 85 90 

Phe Asp Ser Asn Val Pro Ser Asn Lys Ser Asn Phe Ser Arg Gly 
95 100 105 

Asp Glu Arg Arg His Glu Ala Ala Val Pro Pro Leu Ala He Pro 

110 115 120 

Ser Ala Arg Pro Glu Lys Arg Asp Ser Arg Val Ser Thr Ser Ser 

125 130 135 

Gin Glu Ser Lys Thr Thr Asn Val Arg Gin Thr Tyr Asp Asp Gly 

140 145 150 

Ala Ala Thr Arg Leu Met Ser Thr Val Lys Pro Leu Arg Glu Pro 

155 160 165 

Ala Pro Ser Glu Asp Val He Asp He Lys Pro Glu Pro Asp Asp 

170 175 180 

Leu He Asp Glu Asp Leu Asn Phe Val Gin Glu Asn Pro Leu Ser 

185 190 195 

Gin Lys Lys Pro Thr Val Thr Leu Thr Tyr Gly Ser Ser Arg Pro 

200 205 210 

Ser He Glu He Tyr Arg Pro Pro Ala Ser Arg Asn Ala Asp Ser 

215 220 225 

Gly Val His Leu Asn Arg Leu Gin Phe Gin Gin Gin Gin Asn Ser 

230 235 240 

He His Ala Ala Lys Gin Leu Asp Met Gin Ser Ser Trp Val Tyr 

245 250 255 

Glu Thr Gly Arg Leu Cys Glu Pro Glu Val Leu Asn Ser Leu Glu 

260 265 270 

Glu Thr Tyr Ser Pro Phe Phe Arg Asn Asn Ser Glu Lys Met Ser 

275 280 285 

Met Glu Asp Glu Asn Phe Arg Lys Arg Lys Leu Pro Val Val Ser 

290 295 300 

Ser Val Val Lys Val Lys Lys Phe Asn His Asp Gly Glu Glu Glu 

305 310 315 

Glu Glu Asp Asp Asp Tyr Gly Ser Arg Thr Gly Ser He Ser Ser 

320 325 330 

Ser Val Ser Val Pro Ala Lys Pro Glu Arg Arg Pro Ser Leu Pro 

335 340 345 

Pro Ser Lys Gin Ala Asn Lys Asn Leu He Leu Lys Ala He Ser 

350 355 360 

Glu Ala Gin Glu Ser Val Thr Lys Thr Thr Asn Tyr Ser Thr Val 

365 370 375 

Pro Gin Lys Gin Thr Leu Pro Val Ala Pro Arg Thr Arg Thr Ser 

380 385 390 

Gin Glu Glu Leu Leu Ala Glu Val Val Gin Gly Gin Ser Arg Thr 

395 400 405 

Pro Arg He Ser Pro Pro He Lys Glu Glu Glu Thr Lys Gly Asp 
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410 










415 










420 


Ser 


Val 


Glu 


Lys 


Asn 


Gin 


Gly Thr Gin Gin Arg Gin 


Leu 


Leu 












425 










430 










435 


Arg 


Leu 


Gin 


lie 


Asp 
440 


Pro 


Val 


Met 


Ala 


vilu 

445 


i nr 


Leu 


Gin 


Met 


Ser 
450 


Gin 


Ala 


Glu 


Met 


Ser 
455 


Glu 


Leu 


Ser 


Val 


Ala 
460 


bin 


Lys 


Pro 


Glu 


465 


Leu 


Leu 


Glu 


Arg 


Cys 


Lys 


Tyr 


Trp 


Pro 


Ala 

Ala 


cys 


Lys 


Asn Gly 


Asp 










470 










475 










480 


Glu 


Cya 


Ala 


Tyr 


His 
485 


His 


Pro 


lie 


Ser 


Pro 
490 


Cys 


Lys 


Ala 


Phe 


xrro 
495 


Asn 


Cys 


Lys 


Phe 


Ala 
500 


Glu 


Lys 


Cys 


Leu 


Phe 
505 


Val 


His 


Pro 


Asn 


Cys 
510 


Lys 


Tyr 


Asp 


Ala 


Lys 
515 


Cys 


Thr 


Lys 


Pro 


Asp 
520 


Cys 


Pro 


Phe 


Thr 


IT i _ 

HIS 
525 


Val 


Ser 


Arg 


Arg 


lie 
530 


Pro 


Val 


Leu 


Ser 


Pro 
535 


Lys 


Pro 


Ala 


Val 


Ala 
540 


Pro 


Pro 


Ala 


Pro 


Pro 
545 


Ser 


Ser 


Ser 


Gin 


Leu 
550 


Cys 


Arg 


Tyr 


Phe 


Pro 
555 


Ala 


Cys 


Lys 


Lys 


Met 
560 


Glu 


Cys 


Pro 


Phe 


Tyr 
565 


His 


Pro 


Lys 


His 


Cys 
570 


Arg 


Phe 


Asn 


Thr 


Gin 


Cys 


Thr Arg Pro Asp Cys Thr Phe Tyr His 










575 










580 










585 


Pro 


Thr 


lie 


Asn 


Val 


Pro 


Pro 


Arg 


His 


Ala 


Leu Lys 


Trp 


He 


Arg 










590 










595 










600 


Pro 


Gin 


Thr 


Ser 


Glu 
605 























<210> 38 

<211> 865 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_ feature 

<223> Incyte ID No: 1814553CD1 



<400> 38 

Met Thr Glu Val Lys Val Lys Thr Glu Leu Pro Asp Asp Tyr He 
15 10 15 

Gin Glu Val He Trp Gin Gly Glu Ala Lys Glu Glu Lys Lys Ala 

20 25 30 

Val Ser Lys Asp Gly Thr Ser Asp Val Pro Ala Glu He Cys Val 

35 40 45 

Val He Gly Gly Val Arg Asn Gin Gin Thr Leu Gly Ser Tyr Glu 

50 55 60 

Cys Gly He Cys Gly Lys Lys Tyr Lys Tyr Tyr Asn Cys Phe Gin 

65 70 75 

Thr His Val Arg Ala His Arg Asp Thr Glu Ala Thr Ser Gly Glu 

80 85 90 

Gly Ala Ser Gin Ser Asn Asn Phe Arg Tyr Thr Cys Asp He Cys 

95 100 105 

Gly Lys Lys Tyr Lys Tyr Tyr Ser Cys Phe Gin Glu His Arg Asp 
110 115 120 

Leu His Ala Val Asp Val Phe Ser Val Glu Gly Ala Pro Glu Asn 
125 130 135 
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Arg Ala Asp Pro Phe Asp Gin Gly Val Val Ala Thr Asp Qlu Val 

140 145 150 

Lys Glu Glu Pro Pro Glu Pro Phe Gin Lys He Gly Pro Lys Thr 

155 160 165 

Gly Asn Tyr Thr Cys Glu Phe Cys Gly Lys Gin Tyr Lys Tyr Tyr 

170 175 180 

Thr Pro Tyr Gin Glu His Val Ala Leu His Ala Pro He Ser Thr 

185 190 195 

Ala Pro Gly Trp Glu Pro Pro Asp Asp Pro Asp Thr Gly Ser Glu 

200 205 210 

Cys Ser His Pro Glu Val Ser Pro Ser Pro Arg Phe Val Ala Ala 

215 220 225 

Lys Thr Gin Thr Asn Gin Ser Gly Lys Lys Ala Pro Ala Ser Val 

230 235 240 

Val Arg Cys Ala Thr Leu Leu His Arg Thr Pro Pro Ala Thr Gin 

245 250 255 

Thr Gin Thr Phe Arg Thr Pro Asn Ser Gly Ser Pro Ala Ser Lys 

260 265 270 

Ala Thr Ala Glu Ser Ala Phe Ser Arg Arg Val Glu Gly Lys Ala 

275 280 285 

Gin Asn His Phe Glu Glu Thr Asn Ser Ser Ser Gin Asn Ser Ser 

290 295 300 

Glu Pro Tyr Thr Cys Gly Ala Cys Gly He Gin Phe Gin Phe Tyr 

305 310 315 

Asn Asn Leu Leu Glu His Met Gin Ser His Ala Ala Asp Asn Glu 

320 325 330 

Asn Asn He Ala Ser Asn Gin Ser Arg Ser Pro Pro Ala Val Val 

335 340 345 

Glu Glu Lys Trp Lys Pro Gin Ala Gin Arg Asn Ser Ala Asn Asn 

350 355 360 

Thr Thr Thr Ser Gly Leu Thr Pro Asn Ser Met He Pro Glu Lys 

365 370 375 

Glu Arg Gin Asn He Ala Glu Arg Leu Leu Arg Val Met Cys Ala 

380 385 390 

Asp Leu Gly Ala Leu Ser Val Val Ser Gly Lys Glu Phe Leu Lys 

395 400 405 

Leu Ala Gin Thr Leu Val Asp Ser Gly Ala Arg Tyr Gly Ala Phe 

410 415 420 

Ser Val Thr Glu He Leu Gly Asn Phe Asn Thr Leu Ala Leu- Lys 

425 430 435 

His Leu Pro Arg Met Tyr Asn Gin Val Lys Val Lys Val Thr Cys 

440 445 450 

Ala Leu Gly Ser Asn Ala Cys Leu Gly He Gly Val Thr Cys His 

455 460 465 

Ser Gin Ser Val Gly Pro Asp Ser Cys Tyr He Leu Thr Ala Tyr 

470 475 480 

Gin Ala Glu Gly Asn His He Lys Ser Tyr Val Leu Gly Val Lys 

485 490 495 

Gly Ala Asp He Arg Asp Ser Gly Asp Leu Val His His Trp Val 

500 505 510 

Gin Asn Val Leu Ser Glu Phe Val Met Ser Glu He Arg Thr Val 

515 520 525 

Tyr Val Thr Asp Cys Arg Val Ser Thr Ser Ala Phe Ser Lys Ala 

530 535 540 

Gly Met Cys Leu Arg Cys Ser Ala Cys Ala Leu Asn Ser Val Val 

545 550 555 
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Gin 


Ser 


Val 


Leu 


Ser 

C e r\ 

560 


Lys 


Arg 


Thr 


Leu 


Gin 
565 


Ala 


Arg 


Ser 


Met 


His 
570 


Glu 


Val 


He 


Glu 


Leu 


Leu 


Asn 


Val 


Cys 


Glu Asp Leu Ala Gly Ser 










575 










con 










CQC 

jOj 


Thr Gly Leu Ala 


Lys 


Glu Thr Phe Gly 


Ser 


Leu 


Glu 


Glu 


Thr 


Ser 










590 










595 










600 


Pro 


Pro 


Pro 


Cys 


Trp 
605 


Asn 


Ser 


Val 


Thr 


Asp 
610 


Ser 


Leu 


Leu 


Leu 


Val 
615 


His 


Glu 


Arg 


Tyr 


Glu 


Gin 


He 


Cys 


Glu 


Phe 


Tyr 


Ser Arg 


Ala 


Lys 










620 










625 










630 


Lys 


Met 


Asn 


Leu 


He 
635 


Gin 


Ser 


Leu 


Asn 


Lys 
640 


His 


Leu 


Leu 


Ser 


Asn 
645 


Leu 


Ala 


Ala 


He 


Leu 
650 


Thr 


Pro 


Val 


Lys 


Gin 
655 


Ala 


Val 


He 


Glu 


Leu 
660 


Ser 


Asn 


Glu 


Ser 


Gin 


Pro 


Thr 


Leu 


Gin 


Leu 


Val 


Leu 


Pro Thr Tyr 










665 










670 










675 


Val 


Arg 


Leu 


Glu 


Lys 


Leu 


Phe 


Thr 


Ala 


Lys 


Ala Asn Asp Ala Gly 










680 










685 










690 


Thr 


Val 


Ser Lys 


Leu 


Cys 


His 


Leu 


Phe 


Leu 


Glu 


Ala 


Leu 


Lys 


Glu 










695 










700 










705 


Asn 


Phe 


Lys 


Val 


His 
710 


Pro 


Ala 


His 


Lys 


Val 
715 


Ala 


Met 


He 


Leu 


Asp 
720 


Pro 


Gin 


Gin 


Lys 


Leu 


Arg 


Pro 


Val 


Pro 


Pro Tyr Gin His Glu Glu 










72d 










730 










735 


He 


He Gly Lys 


Val 


Cys 


Glu 


Leu 


He 


Asn 


Glu 


Val 


Lys 


Glu 


Ser 










740 










745 










750 


Trp 


Ala 


Glu 


Glu 


Ala 
755 


Asp 


Phe 


Glu 


Pro 


Ala 
760 


Ala 


Lys 


Lys 


Pro 


Arg 
765 


Ser 


Ala 


Ala 


Val 


Glu 
770 


Asn 


Pro 


Ala 


Ala 


Gin 
775 


Glu 


Asp 


Asp 


Arg 


Leu 
780 


Gly Lys 


Asn 


Glu 


Val 


Tyr Asp 


Tyr Leu 


Gin 


Glu 


Pro 


Leu 


Phe 


Gin 










785 










790 










795 


Ala 


Thr 


Pro 


Asp 


Leu 


Phe 


Gin 


Tyr Trp 


Ser 


Cys 


Val 


Thr 


Gin 


Lys 










800 










805 










810 


His 


Thr 


Lys 


Leu 


Ala 
815 


Lys 


Leu 


Ala 


Phe 


Trp 
820 


Leu 


Leu 


Ala 


Val 


Pro 
825 


Ala 


Val 


Gly Ala 


Arg 


Ser Gly Cys Val 


Asn 


Met 


Cys 


Glu 


Gin 


Ala 










830 










835 










840 


Leu 


Leu 


He 


Lys 


Arg 


Arg 


Arg 


Leu 


Leu 


Ser 


Pro 


Glu 


Asp Met 


Asn 










845 










850 










855 


Lys 


Leu 


Met 


Phe 


Leu 
860 


Lys 


Ser 


Asn 


Met 


Leu 
865 













<210> 39 

<211> 219 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 71217830CD1 

<400> 39 

Met Tyr He Tyr Thr Phe Leu Ser His Leu Val Lys Cys Leu Lys 
15 10 15 

He Arg Val lie Phe Asn Leu He Phe Leu He Leu Gly Lys Tyr 
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20 



25 



30 



He Ala Ser Thr Gin Arg Pro Asp Gly Thr Trp Arg Lys Gin Arg 
35 40 45 

Arg Val Lys Glu Gly Tyr Val Pro Gin Glu Glu Val Pro Val Tyr 
50 55 60 

Glu Asn Lys Tyr Val Lys Phe Phe Lys Ser Lys Pro Glu Leu Pro 
65 70 75 

Pro Gly Leu Ser Pro Glu Ala Thr Ala Pro Val Thr Pro Ser Arg 
80 85 90 

Pro Glu Gly Gly Glu Pro Gly Leu Ser Lys Thr Ala Lys Arg Asn 
95 100 105 

Leu Lys Arg Lys Glu Lys Arg Arg Gin Gin Gin Glu Lys Gly Glu 

110 115 120 

Ala Glu Ala Leu Ser Arg Thr Leu Asp Lys Val Ser Leu Glu Glu 

125 130 135 

Thr Ala Gin Leu Pro Ser Ala Pro Gin Gly Ser Arg Ala Ala Pro 

140 145 150 

Thr Ala Ala Ser Asp Gin Pro Asp Ser Ala Ala Thr Thr Glu Lys 

155 160 165 

Ala Lys Lys He Lys Asn Leu Lys Lys Lys Leu Arg Gin Val Glu 

170 175 180 

Glu Leu Gin Gin Arg He Gin Ala Gly Glu Val Ser Gin Pro Ser 

185 190 195 

Lys Glu Gin Leu Glu Lys Leu Ala Arg Arg Arg Ala Leu Glu Glu 

200 205 210 

Glu Leu Glu Asp Leu Glu Leu Gly Leu 



<210> 40 
<211> 1144 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 7506252CD1 

<400> 40 

Met Glu Arg Leu Arg Asp Val Arg Glu Arg Leu Gin Ala Trp Glu 
15 10 15 

Arg Ala Phe Arg Arg Gin Arg Gly Arg Arg Pro Ser Gin Asp Asp 

20 25 30 

Val Glu Ala Ala Pro Glu Glu Thr Arg Ala Leu Tyr Arg Glu Tyr 

35 40 45 

Arg Thr Leu Lys Arg Thr Thr Gly Gin Ala Gly Gly Gly Leu Arg 

50 55 60 

Ser Ser Glu Ser Leu Pro Ala Ala Ala Glu Glu Ala Pro Glu Pro 

65 70 75 

Arg Cys Trp Gly Pro His Leu Asn Arg Ala Ala Thr Lys Ser Pro 

80 85 90 

Gin Pro Thr Pro Gly Arg Ser Arg Gin Gly Ser Val Pro Asp Tyr 

95 100 105 

Gly Gin Arg Leu Lys Ala Asn Leu Lys Gly Thr Leu Gin Ala Gly 
110 115 120 

Pro Ala Leu Gly Arg Arg Pro Trp Pro Leu Gly Arg Ala Ser Ser 



215 



125 



130 



135 
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Lys Ala Ser Thr Pro 
140 

Phe Ala Qlu Lys Val 
155 

Gin Pro Arg Pro Gly 
170 

Gin Arg Leu Gly Ser 
185 

Ser Glu Val Pro Asp 
200 

Asp Leu Gly Ser Glu 
215 

Ala Val Leu Gly Pro 
230 

Ala Phe Gin Glu Val 
245 

Ser Ser Gly Gly Glu 
260 

Ser Pro Ala Gin Val 
275 

Ser Glu Gly Ala Gly 
290 

Glu Pro Val Gin Ala 
305 

Asn Pro Arg Tyr His 
320 

Gly Lys Ala Glu Gly 
335 

Ala Arg His Asp Arg 
350 

Lys His Tyr Val Arg 
365 

Lys Gin Ala Trp Lys 
380 

Gly Gly Gly Gly Ala 
395 

Asn Glu Gin Phe Asp 
410 

Ser Glu Glu Asp Thr 
425 

Ser Pro Gin Pro Val 
440 

Leu Pro Leu Tyr Ser 
455 

Pro Ala Glu Val Phe 
470 

Phe Arg Pro Gly Gin 
485 

lie Ser Thr Leu Leu 
500 

Cys Tyr Gin Leu Pro 
515 

Leu Thr Leu Val Val 
530 

Val Ser Gly Leu Pro 
545 



Lys Pro Pro Gly Thr 
145 

Ser Asp Glu Pro Pro 
160 

Arg Leu Gin His Leu 
175 

Leu Asp Pro Gly Trp 
190 

Phe Leu Gly Ala Pro 
205 

Glu Ser Gin Leu Leu 
220 

Gly Ala Gly Ser Gin 
235 

Ser He Arg Val Gly 
250 

Lys Arg Arg Trp Asn 
265 

Gin Gin Glu Ser Ser 
280 

Ala Val Ala Val Glu 
295 

Gin Pro Pro Gin Pro 
310 

Gly Leu Ser Pro Ser 
325 

Thr Ala Pro Leu His 
340 

Gly Asn Tyr Val Arg 
355 

Gly Arg Ala Leu Arg 
370 

Gin Lys Trp Arg Lys 
385 

Thr Val Thr Thr Lys 
400 

His Trp Ala Ala Gin 
415 

Asp Ala Val Gly Pro 
430 

Pro Glu Val Pro Ser 
445 

Leu Gly Pro Ser Gly 
460 

Gin Ala Leu Glu Gin 
475 

Glu Arg Ala Val Met 
490 

Val Leu Pro Thr Gly 
505 

Ala Leu Leu Tyr Ser 
520 

Ser Pro Leu Leu Ser 
535 

Pro Cys Leu Lys Ala 
550 



Gly Pro Val Pro Ser 
150 

Gin Leu Pro Glu Pro 
165 

Gin Ala Ser Leu Ser 
180 

Leu Gin Arg Cys His 
195 

Lys Ala Cys Arg Pro 
210 

He Pro Gly Glu Ser 
225 

Gly Pro Glu Ala Ser 
240 

Ser Pro Gin Pro Ser 
255 

Glu Glu Pro Trp Glu 
270 

Gin Ala Gly Pro Pro 
285 

Glu Asp Pro Pro Gly 
300 

Cys Ser Ser Pro Ser 
315 

Ser Gin Ala Arg Ala 
330 

He Phe Pro Arg Leu 
345 

Leu Asn Met Lys Gin 
360 

Ser Arg Leu Leu Arg 
375 

Lys Gly Glu Cys Phe 
390 

Glu Ser Cys Phe Leu 
405 

Cys Pro Arg Pro Ala 
420 

Glu Pro Leu Val Pro 
435 

Leu Asp Pro Thr Val 
450 

Gin Leu Ala Glu Thr 
465 

Leu Gly His Gin Ala 
480 

Arg He Leu Ser Gly 
495 

Ala Gly Lys Ser Leu 
510 

Arg Arg Ser Pro Cys 
525 

Leu Met Asp Asp Gin 
540 

Ala Cys He His Ser 
555 



66/141 



WO 03/054219 



PCT/US02/41115 



Gly Met Thr Arg Lys Gin Arg Qlu Ser Val Leu Gin Lys lie Arg 

560 565 570 

Ala Ala Gin Val His Val Leu Met Leu Thr Pro Glu Ala Leu Val 

575 580 585 

Gly Ala Gly Gly Leu Pro Pro Ala Ala Gin Leu Pro Pro Val Ala 

590 595 600 

Phe Ala Cys lie Asp Glu Ala His Cys Leu Ser Gin Trp Ser His 

605 610 615 

Asn Phe Arg Pro Cys Tyr Leu Arg Val Cys Lys Val Leu Arg Glu 

620 625 630 

Arg Met Gly Val His Cys Phe Leu Gly Leu Thr Ala Thr Ala Thr 

635 640 645 

Arg Arg Thr Ala Ser Asp Val Ala Gin His Leu Ala Val Ala Glu 

650 655 660 

Glu Pro Asp Leu His Gly Pro Ala Pro Val Pro Thr Asn Leu His 

665 670 675 

Leu Ser Val Ser Met Asp Arg Asp Thr Asp Gin Ala Leu Leu Thr 

680 685 690 

Leu Leu Gin Gly Lys Arg Phe Gin Asn Leu Asp Ser lie lie He 

695 700 705 

Tyr Cys Asn Arg Arg Glu Asp Thr Glu Arg He Ala Ala Leu Leu 

710 715 720 

Arg Thr Cys Leu His Ala Ala Trp Val Pro Gly Ser Gly Gly Arg 

725 730 735 

Ala Pro Lys Thr Thr Ala Glu Ala Tyr His Ala Gly Met Cys Ser 

740 745 750 

Arg Glu Arg Arg Arg Pro Gin Gly Glu Asp Leu Arg Glu Leu Arg 

755 760 765 

Arg His Val His Ala Asp Ser Thr Asp Phe Leu Ala Val Lys Arg 

770 775 780 

Leu Val Gin Arg Val Phe Pro Ala Cys Thr Cys Thr Cys Thr Arg 

785 790 795 

Pro Pro Ser Glu Gin Glu Gly Ala Val Gly Gly Glu Arg Pro Val 

800 805 810 

Pro Lys Tyr Pro Pro Gin Glu Ala Glu Gin Leu Ser His Gin Ala 

815 820 825 

Ala Pro Gly Pro Arg Arg Val Cys Met Gly His Glu Arg Ala Leu 

830 835 840 

Pro He Gin Leu Thr Val Gin Ala Leu Asp Met Pro Glu Glu Ala 

845 850 855 

He Glu Thr Leu Leu Cys Tyr Leu Glu Leu His Pro His His Trp 

860 865 870 

Leu Glu Leu Leu Ala Thr Thr Tyr Thr His Cys Arg Leu Asn Cys 

875 880 885 

Pro Gly Gly Pro Ala Gin Leu Gin Ala Leu Ala His Arg Cys Pro 

890 895 900 

Pro Leu Ala Val Cys Leu Ala Gin Gin Leu Pro Glu Asp Pro Gly 

905 910 915 

Gin Gly Ser Ser Ser Val Glu Phe Asp Met Val Lys Leu Val Asp 

920 925 930 

Ser Met Gly Trp Glu Leu Ala Ser Val Arg Arg Ala Leu Cys Gin 

935 940 945 

Leu Gin Trp Asp His Glu Pro Arg Thr Gly Val Arg Arg Gly Thr 

950 955 960 

Gly Val Leu Val Glu Phe Ser Glu Leu Ala Phe His Leu Arg Ser 

965 970 975 
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Pro Gly Asp Leu Thr Ala Glu Glu Lys Asp Gin He Cys Asp Phe 

980 985 990 

Leu Tyr Gly Arg Val Gin Ala Arg Glu Arg Gin Ala Leu Ala Arg 

995 1000 1005 

Leu Arg Arg Thr Phe Gin Ala Phe His Ser Val Ala Phe Pro Ser 

1010 1015 1020 

Cys Gly Pro Cys Leu Glu Gin Gin Asp Glu Glu Arg Ser Thr Arg 

1025 1030 1035 

Leu Lys Asp Leu Leu Gly Arg Tyr Phe Glu Glu Glu Glu Gly Gin 

1040 1045 1050 

Glu Pro Gly Gly Met Glu Asp Ala Gin Gly Pro Glu Pro Gly Gin 

1055 1060 1065 

Ala Arg Leu Gin Asp Trp Glu Asp Gin Val Arg Cys Asp He Arg 

1070 1075 1080 

Gin Phe Leu Ser Leu Arg Pro Glu Glu Lys Phe Ser Ser Arg Ala 

1085 1090 1095 
Val Ala Arg He Phe His Gly He Gly Ser Pro Cys Tyr Pro Ala 

1100 1105 1110 

Gin Val Tyr Gly Gin Asp Arg Arg Phe Trp Arg Lys Tyr Leu His 

1115 1120 1125 

Leu Ser Phe His Ala Leu Val Gly Leu Ala Thr Glu Glu Leu Leu 

1130 1135 1140 

Gin Val Ala Arg 



<210> 41 
<211> 217 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: 2270608CD1 

<400> 41 

Met He Asn His Leu Ser Pro His Gin Ala Ala Ala Pro Val Asp 

15 10 15 

Gin Thr Pro Arg Thr Leu Ala Thr Met Gly Gin Arg Ala Leu Pro 

20 25 30 

Ser Ser Leu Ala Leu Leu Ser Arg Pro Leu Ser Pro Pro Pro Ala 

35 40 45 

Ala Cys Ser Gly Asp Pro Gly Cys Gly Ser Gly Ala Gly Leu Pro 

50 55 60 

Ser Ala Ser Ala Ala Ala Gly He Ala Ser Ser Ala Val Glu Pro 

65 70 75 

Val Cys Gly Asp Ala Ala Pro Ala Cys Leu Leu Arg Thr Pro Leu 

80 85 90 

Arg Gly Leu Leu Lys Pro Thr Gly Pro Arg Ser Thr Met Glu Cys 

95 100 105 

Pro Pro Ala Leu He Val His Pro Pro Ala Gly Gly Met Ala Ser 

110 115 120 

Gly Ser Ser Gin Pro Trp Ala Ala Ala Ser Ala Thr Pro Met Leu 

125 130 135 

Ser Ser Lys Ala Ser Leu Cys He Pro Thr Arg Gly Pro Pro Pro 

140 145 150 

Gin Pro Leu Met Arg Thr Pro Ala Ala Arg Ser His Trp Pro He 
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155 



160 



165 



Pro His Pro Cys Asp Thr Ala Cys Pro Ala Pro Leu Pro Val Val 
170 175 180 

Leu Val Ala Pro Arg Ser Thr lie Leu Ser Met Ser Arg Thr Trp 
185 190 195 

Thr Cys Arg Arg Trp Ala Val Ala Pro Cys Arg Ala Glu Lys Leu 
200 205 210 

Met Cys Ser Ser Ser Arg Ser 



<210> 42 

<211> 309 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7502428CD1 

<400> 42 

Met Gin Ser Leu Trp Glu Lys Ala Cys Glu Asn Leu Arg Asn Leu 
15 10 15 

Asn Met Glu Thr Thr Arg Thr Arg Cys Trp Lys Asp Tyr Val Ser 
20 25 30 

Leu Arg lie Glu Ala lie Arg Ala Glu Tyr Gin Lys Met Pro Ala 
35 40 45 

Phe Leu His Glu Glu Glu Gin His His Leu Glu Arg Leu Arg Lys 
50 55 60 

Glu Gly Glu Asp lie Phe Gin Gin Leu Asn Glu Ser Lys Ala Arg 
65 70 75 

Met Glu His Ser Arg Glu Leu Leu Arg Gly Met Tyr Glu Asp Leu 
80 85 90 

Lys Gin Met Cys His Lys Ala Asp Val Glu Leu Leu Gin Ala Phe 
95 100 105 

Gly Asp He Leu His Arg Tyr Glu Ser Leu Leu Leu Gin Val Ser 

110 115 120 

Glu Pro Val Asn Pro Glu Leu Ser Ala Gly Pro He Thr Gly Leu 

125 130 135 

Leu Asp Ser Leu Ser Gly Phe Arg Val Asp Phe Thr Leu Gin Pro 

140 145 150 

Glu Arg Ala Asn Ser His He Phe Leu Cys Gly Asp Leu Arg Ser 

155 160 165 

Met Asn Val Gly Cys Asp Pro Gin Asp Asp Pro Asp He Thr Gly 

170 175 180 

Lys Ser Glu Cys Phe Leu Val Trp Gly Ala Gin Ala Phe Thr Ser 

185 190 195 

Gly Lys Tyr Tyr Trp Glu Val His Met Gly Asp Ser Trp Asn Trp 

200 205 210 

Ala Phe Gly Val Cys Asn Asn Tyr Trp Lys Glu Lys Arg Gin Asn 

215 220 225 

Asp Lys He Asp Gly Glu Glu Gly Leu Phe Leu Leu Gly Cys Val 

230 235 240 

Lys Glu Asp Thr His Cys Ser Leu Phe Thr Thr Thr Pro Leu Val 

245 250 255 

Val Gin Tyr Val Pro Arg Pro Thr Ser Thr Val Gly Leu Phe Leu 



215 



260 



265 



270 
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Asp Cys Glu Gly Arg lie Met Ser Phe Val Asp Val Asp Gin Ser 
275 280 285 

Phe Leu lie Tyr lie lie Pro Asn Cys Ser Phe Ser Pro Pro Leu 
290 295 300 

Arg Pro lie Phe Cys Cys Ser His Phe 
305 

<210> 43 
<211> 483 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 368741CD1 

<400> 43 

Met Trp Leu Gly Thr Ser Gly Lys Ser Gly Leu Pro Gly His Cys 
15 10 15 

Leu Glu Asn Pro Leu Gin Glu Cys His Pro Ala Gin Leu Glu Glu 
20 25 30 

Trp Ala Leu Lys Gly lie Ser Arg Pro Ser Val He Ser Gin Pro 
35 40 45 

Glu Gin Lys Glu Glu Pro Trp Val Leu Pro Leu Gin Asn Phe Glu 
50 55 60 

Ala Arg Lys He Pro Arg Glu Ser His Thr Asp Cys Glu His Gin 
65 70 75 

Val Ala Lys Leu Asn Gin Asp Asn Ser Glu Thr Ala Glu Gin Cys 
80 85 90 

Gly Thr Ser Ser Glu Arg Thr Asn Lys Asp Leu Ser His Thr Leu 
95 100 105 

Ser Trp Gly Gly Asn Trp Glu Gin Gly Leu Glu Leu Glu Gly Gin 

110 115 120 

Tyr Gly Thr Leu Pro Gly Glu Gly Gin Leu Glu Ser Phe Ser Gin 

125 130 135 

Glu Arg Asp Leu Asn Lys Leu Leu Asp Gly Tyr Val Gly Glu Lys 

140 145 150 

Pro Met Cys Ala Glu Cys Gly Lys Ser Phe Asn Gin Ser Ser Tyr 

155 160 165 

Leu He Arg His Leu Arg Thr His Thr Gly Glu Arg Pro Tyr Thr 

170 175 180 

Cys He Glu Cys Gly Lys Gly Phe Lys Gin Ser Ser Asp Leu Val 

185 190 195 

Thr His Arg Arg Thr His Thr Gly Glu Lys Pro Tyr Gin Cys Lys 

200 205 210 

Gly Cys Glu Lys Lys Phe Ser Asp Ser Ser Thr Leu He Lys His 

215 220 225 

Gin Arg Thr His Thr Gly Glu Arg Pro Tyr Glu Cys Pro Glu Cys 

230 235 240 

Gly Lys Thr Phe Gly Arg Lys Pro His Leu He Met His Gin Arg 

245 250 255 

Thr His Thr Gly Glu Lys Pro Tyr Ala Cys Leu Glu Cys His Lys 

260 265 270 

Ser Phe Ser Arg Ser Ser Asn Phe He Thr His Gin Arg Thr His 

275 280 285 

Thr Gly Val Lys Pro Tyr Arg Cys Asn Asp Cys Gly Glu Ser Phe 
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290 



295 



300 



Ser Gin Ser Ser Asp Leu He Lys His Gin Arg Thr Hie Thr Gly 

305 310 315 

Glu Arg Pro Phe Lys Cys Pro Glu Cys Gly Lys Gly Phe Arg Asp 

320 325 330 

Ser Ser His Phe Val Ala His Met Ser Thr His Ser Gly Glu Arg 

335 340 345 

Pro Phe Ser Cys Pro Asp Cys His Lys Ser Phe Ser Gin Ser Ser 

350 355 360 

His Leu Val Thr His Gin Arg Thr His Thr Gly Glu Arg Pro Phe 

365 370 375 

Lys Cys Glu Asn Cys Gly Lys Gly Phe Ala Asp Ser Ser Ala Leu 

380 385 390 

He Lys His Gin Arg He His Thr Gly Glu Arg Pro Tyr Lys Cys 

395 400 405 

Gly Glu Cys Gly Lys Ser Phe Asn Gin Ser Ser His Phe He Thr 

410 415 420 

His Gin Arg He His Leu Gly Asp Arg Pro Tyr Arg Cys Pro Glu 

425 430 435 

Cys Gly Lys Thr Phe Asn Gin Arg Ser His Phe Leu Thr His Gin 

440 445 450 

Arg Thr His Thr Gly Glu Lys Pro Phe His Cys Ser Lys Cys Asn 

455 460 465 

Lys Ser Phe Arg Gin Lys Ala His Leu Leu Cys His Gin Asn Thr 

470 475 480 

His Leu He 



<210> 44 

<211> 137 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 7506379CD1 

<400> 44 

Met Lys Ala Ser Gly Thr Leu Arg Glu Tyr Lys Leu Lys Lys Met 
15 10 15 

Lys Lys Ser Ser Gly Glu He Val Tyr Cys Gly Gin Val Phe Glu 

20 25 30 

Lys Ser Pro Leu Arg Val Lys Asn Phe Gly He Trp Leu Arg Tyr 

35 40 45 

Asp Ser Arg Ser Gly Thr His Asn Met Tyr Arg Glu Tyr Arg Asp 

50 55 60 

Leu Thr Thr Ala Gly Ala Val Thr Gin Cys Tyr Arg Asp Met Gly 

65 70 75 

Ala Arg His Arg Ala Arg Ala His Ser He Gin He Met Lys Val 

80 85 90 

Glu Glu He Ala Ala Ser Lys Cys Arg Arg Pro Ala Val Lys Gin 

95 100 105 

Phe His Asp Ser Lys He Lys Phe Pro Leu Pro His Arg Val Leu 
110 115 120 

Arg Arg Gin His Lys Pro Arg Phe Thr Thr Lys Arg Pro Asn Thr 



125 



130 



135 
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Phe Phe 



<210> 45 

<211> 200 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7506253CD1 



<400> 45 

Met Asn Asn Lys Phe 
1 5 
His Asp Gin Asn Glu 
20 

Glu Lys Thr Glu Arg 
35 

Val Glu Gin Phe Trp 
50 

Asp Leu Thr Gly His 
65 

Lys Pro Met Trp Glu 
80 

He He Arg Leu Arg 
95 

Leu He Leu Ala Met 
110 

He Cys Gly Ala Val 
125 

Ser He Trp Asn Lys 
140 

lie Arg Asp Thr Leu 
155 

He Met Glu Tyr Lys 
170 

Arg Leu Gly Pro Gin 
185 

Arg Leu Asn Val Pro 
200 



Asp Ala Leu Lys Asp 
10 

Glu Asn Ser Thr Gin 
25 

Asp Lys Asn Gin Ser 
40 

Arg Phe Tyr Ser His 
55 

Ser Asp Phe His Leu 
70 

Asp Asp Ala Asn Lys 
85 

Lys Gly Leu Ala Ser 
100 

Leu Gly Glu Gin Phe 
115 

Val Ser Val Arg Phe 
130 

Thr Ala Ser Asp Gin 
145 

Arg Arg Val Leu Asn 
160 

Thr His Thr Asp Ser 
175 

Arg Leu Leu Phe Gin 
190 



Asp Asp Ser Gly Asp 
15 

Lys Asp Gly Glu Lys 
30 

Ser Ser Lys Arg Lys 
45 

Met Val Arg Pro Gly 
60 

Phe Lys Glu Gly He 
75 

Asn Gly Gly Lys Trp 
90 

Arg Cys Trp Glu Asn 
105 

Met Val Gly Glu Glu 
120 

Gin Glu Asp He He 
135 

Ala Thr Thr Ala Arg 
150 

Leu Pro Pro Asn Thr 
165 

He Lys Met Pro Gly 
180 

Asn Leu Trp Lys Pro 
195 



<210> 46 

<211> 123 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7506353CD1 

<400> 46 

Met Glu Phe Pro Asp Leu Gly Lys His Cys Ser Glu Lys Thr Cys 

15 10 15 

Lys Gin Leu Asp Phe Leu Pro Val Lys Cys Asp Ala Cys Lys Gin 

20 25 30 
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Asp Phe Cys Lys Asp His Phe Pro Tyr Ala Ala His Lys Cys Pro 

35 40 45 

Phe Ala Phe Gin Lys Asp Val His Val Pro Val Cys Pro Leu Cys 

50 55 60 

Asn Thr Pro He Pro Val Lys Lys Gly Gin He Pro Asp Val Val 

65 70 75 

Val Gly Asp His He Asp Arg Asp Cys Asp Ser His Pro Gly Lys 

80 85 90 

Lys Lys Glu Lys His Arg His Pro Leu Asp His Ser Cys Arg His 

95 100 105 

Gly Ser Arg Pro Thr He Lys Ala Gly Val Ser Ser Ser Val Val 

110 115 120 

Gly Ser Gly 



<210> 47 

<211> 874 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc — feature 

<223> Incyte ID No: 7506372CD1 

<400> 47 

Met Tyr Gly Ala Ser Gly Gly Arg Ala Lys Pro Glu Arg Lys Ser 
15 10 15 

Gly Ala Lys Glu Glu Ala Gly Pro Gly Gly Ala Gly Gly Gly Gly 
20 25 30 

Ser Arg Val Glu Leu Leu Val Phe Gly Tyr Ala Cys Lys Leu Phe 
35 40 45 

Arg Asp Asp Glu Arg Ala Leu Ala Gin Glu Gin Gly Gin His Leu 
50 55 60 

He Pro Trp Met Gly Asp His Lys He Leu He Asp Arg Tyr Asp 
65 70 75 

Gly Arg Gly His Leu His Asp Leu Ser Glu Tyr Asp Ala Glu Tyr 
80 85 90 

Ser Thr Trp Asn Arg Asp Tyr Gin Leu Ser Glu Glu Glu Ala Arg 
95 100 105 

He Glu Ala Leu Cys Asp Glu Glu Arg Tyr Leu Ala Leu His Thr 
110 115 120 

Asp Leu Leu Glu Glu Glu Ala Arg Gin Glu Glu Glu Tyr Lys Arg 
125 130 135 

Leu Ser Glu Ala Leu Ala Glu Asp Gly Ser Tyr Asn Ala Val Gly 
140 145 150 

Phe Thr Tyr Gly Ser Asp Tyr Tyr Asp Pro Ser Glu Pro Thr Glu 
155 160 165 

Glu Glu Glu Pro Ser Lys Gin Arg Glu Lys Asn Glu Ala Glu Asn 
170 175 180 

Leu Glu Glu Asn Glu Glu Pro Phe Val Ala Pro Leu Gly Leu Ser 
185 190 195 

Val Pro Ser Asp Val Glu Leu Pro Pro Thr Ala Lys Met His Ala 
200 205 210 

He He Glu Arg Thr Ala Ser Phe Val Cys Arg Gin Gly Ala Gin 
215 220 225 

Phe Glu He Met Leu Lys Ala Lys Gin Ala Arg Asn Ser Gin Phe 
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230 235 240 

Asp Phe Leu Arg Phe Asp His Tyr Leu Asn Pro Tyr Tyr Lys Phe 

245 250 255 

lie Gin Lys Ala Met Lys Qlu Gly Arg Tyr Thr Val Leu Ala Glu 

260 265 270 

Asn Lys Ser Asp Glu Lys Lys Lys Ser Gly Val Ser Ser Asp Asn 

275 280 285 

Glu Asp Asp Asp Asp Glu Glu Asp Gly Asn Tyr Leu His Pro Ser 

290 295 300 

Leu Phe Ala Ser Lys Lys Cys Asn Arg Leu Glu Glu Leu Met Lys 

305 310 315 

Pro Leu Lys Val Val Asp Pro Asp His Pro Leu Ala Ala Leu Val 

320 325 330 

Arg Lys Ala Gin Ala Asp Ser Ser Thr Pro Thr Pro His Asn Ala 

335 340 345 

Asp Gly Ala Pro Val Gin Pro Ser Gin Val Glu Tyr Thr Ala Asp 

350 355 360 

Ser Thr Val Ala Ala Met Tyr Tyr Ser Tyr Tyr Met Leu Pro Asp 

365 370 375 

Gly Thr Tyr Cys Leu Ala Pro Pro Pro Pro Gly lie Asp Val Thr 

380 385 390 

Thr Tyr Tyr Ser Thr Leu Pro Ala Gly Val Thr Val Ser Asn Ser 

395 400 405 

Pro Gly Val Thr Thr Thr Ala Pro Pro Pro Pro Gly Thr Thr Pro 

410 415 420 

Pro Pro Pro Pro Thr Thr Ala Glu Thr Ser Ser Gly Ala Thr Ser 

425 430 435 

Thr Thr Thr Thr Thr Ser Ala Leu Ala Pro Val Ala Ala He He 

440 445 450 

Pro Pro Pro Pro Asp Val Gin Pro Val He Asp Lys Leu Ala Glu 

455 460 465 

Tyr Val Ala Arg Asn Gly Leu Lys Phe Glu Thr Ser Val Arg Ala 

470 475 480 

Lys Asn Asp Gin Arg Phe Glu Phe Leu Gin Pro Trp His Gin Tyr 

485 490 495 

Asn Ala Tyr Tyr Glu Phe Lys Lys Gin Phe Phe Leu Gin Lys Glu 

500 505 510 

Gly Gly Asp Ser Met Gin Ala Val Ser Ala Pro Glu Glu Ala Pro 

515 520 525 

Thr Asp Ser Ala Pro Glu Lys Pro Ser Asp Ala Gly Glu Asp Gly 

530 535 540 

Ala Pro Glu Asp Ala Ala Glu Val Gly Ala Arg Ala Gly Ser Gly 

545 550 555 

Gly Lys Lys Glu Ala Ser Ser Ser Lys Thr Val Pro Asp Gly Lys 

560 565 570 

Leu Val Lys Ala Lys Gin Lys Leu Glu Asp Arg Leu Ala Ala Ala 

575 580 585 

Ala Arg Glu Lys Leu Ala Gin Ala Ser Lys Glu Ser Lys Glu Lys 

590 595 600 

Gin Leu Gin Ala Glu Arg Lys Arg Lys Ala Ala Leu Phe Leu Gin 

605 610 615 

Thr Leu Lys Asn Pro Leu Pro Glu Ala Glu Ala Gly Lys He Glu 

620 625 630 

Glu Ser Pro Phe Ser Val Glu Glu Ser Ser Thr Thr Pro Cys Pro 

635 640 645 

Leu Leu Thr Gly Gly Arg Pro Leu Pro Thr Leu Glu Val Lys Pro 
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650 655 660 

Pro Asp Arg Pro Ser Ser Lys Ser Lys Asp Pro Pro Arg Glu Glu 

665 670 675 

Glu Lys Glu Lys Lys Lys Lys Lys His Lys Lys Arg Ser Arg Thr 

680 685 690 

Arg Ser Arg Ser Pro Lys Tyr His Ser Ser Ser Lys Ser Arg Ser 

695 700 705 

Arg Ser His Ser Lys Ala Lys His Ser Leu Pro Ser Ala Tyr Arg 

710 715 720 

Thr Val Arg Arg Ser Arg Ser Arg Ser Arg Ser Pro Arg Arg Arg 

725 730 735 

Ala His Ser Pro Glu Arg Arg Arg Glu Glu Arg Ser Val Pro Thr 

740 745 750 

Ala Tyr Arg Val Ser Arg Ser Pro Gly Ala Ser Arg Lys Arg Thr 

755 760 765 

Arg Ser Arg Ser Pro His Glu Lys Lys Lys Lys Arg Arg Ser Arg 

770 775 780 

Ser Arg Thr Lys Ser Lys Ala Arg Ser Gin Ser Val Ser Pro Ser 

785 790. 795 

Lys Gin Ala Ala Pro Arg Pro Ala Ala Pro Ala Ala His Ser Ala 

800 805 810 

His Ser Ala Ser Val Ser Pro Val Glu Ser Arg Gly Ser Ser Gin 

815 820 825 

Glu Arg Ser Arg Gly Val Ser Gin Glu Lys Glu Ala Gin lie Ser 

830 835 840 

Ser Ala He Val Ser Ser Val Gin Ser Lys He Thr Gin Asp Leu 

845 850 855 

Met Ala Lys Val Arg Ala Met Leu Ala Ala Ser Lys Asn Leu Gin 

860 865 870 

Thr Ser Ala Ser 



<210> 48 
<211> 189 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 7506335CD1 

<400> 48 

Met Ala Ala Asp Thr Pro Gly Lys Pro Ser Ala Ser Pro Met Ala 
15 10 15 

Gly Ala Pro Ala Ser Ala Ser Arg Thr Pro Asp Lys Pro Arg Ser 

20 25 30 

Ala Ala Glu His Arg Lys Ser Ser Arg His Ser Lys Leu Glu Lys 

35 40 45 

Ala Asp He Leu Glu Met Thr Val Arg His Leu Arg Ser Leu Arg 

50 55 60 

Arg Val Gin Val Thr Ala Ala Leu Ser Ala Asp Pro Ala Val Leu 

65 70 75 

Gly Lys Tyr Arg Ala Gly Phe His Glu Cys Leu Ala Glu Val Asn 

80 85 90 

Arg Phe Leu Ala Gly Cys Glu Gly Val Pro Ala Asp Val Arg Ser 

95 100 105 
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Arg Leu Leu Gly His Leu Ala Ala Cys Leu Arg Gin Leu Gly Pro 

110 115 120 

Ser Arg Arg Pro Ala Ser Leu Ser Pro Ala Ala Pro Ala Glu Ala 

125 130 135 

Pro Ala Pro Glu Val Tyr Ala Gly Arg Pro Leu Leu Pro Ser Leu 

140 145 150 

Gly Gly Pro Phe Pro Leu Leu Ala Pro Pro Leu Leu Pro Gly Leu 

155 160 165 

Thr Arg Ala Leu Pro Ala Ala Pro Arg Ala Gly Pro Gin Gly Pro 

170 175 180 
Gly Gly Pro Trp Arg Pro Trp Leu Arg 

185 

<210> 49 
<211> 716 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_£eature 

<223> Incyte ID No: 5546982CD1 

<400> 49 

Met Glu Gly Asp Gly Ser Asp Ser Pro Val Thr lie Lys Asn lie 

15 10 15 

Glu Arg Glu Leu lie Cys Pro Ala Cys Lys Glu Leu Phe Thr His 

20 25 30 

Pro Leu He Leu Pro Cys Gin His Ser He Cys His Lys Cys Val 

35 40 45 

Lys Glu Leu Leu Leu Thr Leu Asp Asp Ser Phe Asn Asp Val Gly 

50 55 60 

Ser Asp Asn Ser Asn Gin Ser Ser Pro Arg Leu Arg Leu Pro Ser 

65 70 75 

Pro Ser Met Asp Lys He Asp Arg He Asn Arg Pro Gly Trp Lys 

80 85 90 

Arg Asn Ser Leu Thr Pro Arg Thr Thr Val Phe Pro Cys Pro Gly 

95 100 105 

Cys Glu His Asp Val Asp Leu Gly Glu Arg Gly He Asn Gly Leu 

110 115 120 

Phe Arg Asn Phe Thr Leu Glu Thr He Val Glu Arg Tyr Arg Gin 

125 130 135 

Ala Ala Arg Ala Ala Thr Ala He Met Cys Asp Leu Cys Lys Pro 

140 145 150 

Pro Pro Gin Glu Ser Thr Lys Ser Cys Met Asp Cys Ser Ala Ser 

155 160 165 

Tyr Cys Asn Glu Cys Phe Lys He His His Pro Trp Gly Thr He 

170 175 180 
Lys Ala Gin His Glu Tyr Val Gly Pro Thr Thr Asn Phe Arg Pro 

185 190 195 
Lys He Leu Met Cys Pro Glu His Glu Thr Glu Arg He Asn Met 

200 205 210 

Tyr Cys Glu Leu Cys Arg Arg Pro Val Cys His Leu Cys Lys Leu 

215 220 225 
Gly Gly Asn His Ala Asn His Arg Val Thr Thr Met Ser Ser Ala 

230 235 240 

Tyr Lys Thr Leu Lys Glu Lys Leu Ser Lys Asp He Asp Tyr Leu 
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lie Gly Lys 
Leu Leu Met 
Glu Ala He 
Arg Lys Ser 
Arg Leu Asp 
Leu Qlu Asn 
Glu Thr Asp 
Leu Arg He 
Ala Ala Gin 
Gin Thr Glu 
Asp Val Pro 
Ala Leu He 
Tyr Val Leu 
Trp Asn Glu 
Leu Glu Asn 
Gly Ser He 
Pro Pro Ala 
Tyr Asn Asn 
Glu Ser Arg 
Gin Val Gly 
Gly He Thr 
Tyr Ser Tyr 
Gin Glu Trp 
Glu Gin Asp 
Asp Ser Ser 
Phe Phe He 
Arg Val Leu 
Asp Lys Gly 



245 
Glu Ser Gin 

260 
Lys Glu Thr 

275 
Thr His Phe 

290 
Ser Val Leu 

305 
Lys Phe Gin 

320 
Asn Gly Leu 

335 
Gin Ser Cys 

350 
Gin Lys Ala 

365 
Thr Ser Phe 

380 
Leu Leu Gly 

395 
Glu He Asn 

410 
Asn Trp His 

425 
Glu Tyr Arg 

440 
He Glu Val 

455 
Ser Ser Thr 

470 
Cys Ser Pro 

485 
Pro Val Phe 

500 
Glu His Leu 

515 
Ala Gly Phe 

530 
Tyr Tyr Thr 

545 
Lys Gly Lys 

560 
Leu Val Lys 

575 
Leu Arg Ser 

590 
Ser Gly His 

605 
Gin Pro Phe 

620 
Pro Lys Ser 

635 
Pro Met Pro 

650 
Lys Val Asn 



Val Lys Ser 
Glu Cys Asn 
Glu Lys Leu 
Lys Ala He 
Thr Gin Met 
Val Gly Tyr 
Phe Val Gin 
Thr Glu Ser 
Glu Asp Tyr 
Glu Leu Ser 
Glu Glu Gin 
His Pro Glu 
Lys He Asn 
Cys Gly Thr 
Tyr Ala Phe 
Cys Ser Arg 
Ser Phe Leu 
Leu Leu Asn 
Asn Leu Leu 
Ser Leu Asp 
His Phe Trp 
Val Gly Val 
Pro Arg Asp 
Asp Ser Gly 
Thr Leu Val 
Pro Thr Ser 
Thr Ser He 
Phe Tyr Asp 



250 

Gin He Ser 
265 

Gly Glu Arg 
280 

Phe Glu Val 
295 

Asp Ser Ser 
310 

Glu Glu Tyr 
325 

Ala Gin Glu 
340 

Thr Ala Lys 
355 

Leu Arg Ser 
370 

Val Val Asn 
385 

Phe Phe Ser 
400 

Ser Lys Val 
415 

Lys Asp Lys 
430 

Arg Asp Asp 
445 

Ser Lys He 
460 

Arg Val Arg 
475 

Glu Leu He 
490 

Phe Asp Glu 
505 

Leu Lys Arg 
520 

Leu Ala Ala 
535 

Tyr He He 
550 

Ala Phe Arg 
565 

Ala Ser Ser 
580 

Ala Val Ser 
595 

Ser Glu Asp 
610 

Thr He Gly 
625 

Ser Asn Glu 
640 

Gly He Phe 
655 

Met Asp Gin 



255 

Glu Leu Asn 
270 

Ala Lys Glu 
285 

Leu Glu Glu 
300 

Lys Lys Leu 
315 

Gin Gly Leu 
330 

Val Leu Lys 
345 

Gin Leu His 
360 

Phe Arg Pro 

375 

Thr Ser Lys 
390 

Ser Gly He 
405 

Tyr Asn Asn 
420 

Ala Asp Ser 
435 

Glu Met Ser 
450 

He Gin Asp 
465 

Ala Tyr Lys 
480 

Leu His Thr 
495 

Lys Cys Gly 
• 510 
Asp Arg Val 
525 

Glu Arg He 
540 

Gly Asp Thr 
555 

Val Glu Pro 
570 

Asp Lys Leu 
585 

Pro Arg Tyr 
600 

Ala Cys Phe 
615 

Met Gin Lys 
630 

Pro Glu Asn 
645 

Leu Asp Cys 
660 

Met Lys Cys 
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665 



670 



675 



Leu Tyr Glu Arg Gin Val Asp Cys Ser His Thr Leu Tyr Pro Ala 
680 685 690 

Phe Ala Leu Met Gly Ser Gly Gly lie Gin Leu Glu Glu Pro He 
695 700 705 

Thr Ala Lys Tyr Leu Glu Tyr Gin Glu Asp Met 



<210> 50 
<211> 503 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_£eature 

<223> Incyte ID No: 7507432CD1 

<400> 50 

Met Ser Glu Ala Arg Gly Glu Pro Gly Ser Gly Pro Glu Ala Gly 
15 10 15 

Ala Arg Phe Phe Cys Thr Ala Gly Arg Gly Leu Glu Pro Phe Val 
20 25 30 

Met Arg Glu Val Arg Ala Arg Leu Ala Ala Thr Gin Val Glu Tyr 
35 40 45 

He Ser Gly Lys Val Phe Phe Thr Thr Cys Ser Asp Leu Asn Met 
50 55 60 

Leu Lys Lys Leu Lys Ser Ala Glu Arg Leu Phe Leu Leu He Lys 
65 70 75 

Lys Gin Phe Pro Leu He He Ser Ser Val Ser Lys Gly Lys He 
80 85 90 

Phe Asn Glu Met Gin Arg Leu He Asn Glu Asp Pro Gly Ser Trp 
95 100 105 

Leu Asn Ala He Ser He Trp Lys Asn Leu Leu Glu Leu Asp Ala 

110 115 120 

Lys Lys Glu Lys Leu Ser Gin Arg Asp Asp Asn Gin Leu Lys Arg 

125 130 135 

Lys Val Gly Glu Asn Glu He He Ala Lys Lys Leu Lys He Glu 

140 145 150 

Gin Met Gin Lys He Glu Glu Asn Arg Asp Cys Gin Leu Glu Lys 

155 160 165 

Gin He Lys Glu Glu Thr Leu Glu Gin Arg Asp Phe Thr Thr Lys 

170 175 180 

Ser Glu Lys Phe Gin Glu Glu Glu Phe Gin Asn Asp He Glu Lys 

185 190 195 

Ala He Asp Thr His Asn Gin Asn Asp Leu Thr Phe Arg Val Ser 

200 205 210 

Cys Arg Cys Ser Gly Thr He Gly Lys Ala Phe Thr Ala Gin Glu 

215 220 225 

Val Gly Lys Val He Gly He Ala He Met Lys His Phe Gly Trp 

230 235 240 

Lys Ala Asp Leu Arg Asn Pro Gin Leu Glu He Phe He His Leu 

245 250 255 

Asn Asp He Tyr Ser Val Val Gly He Pro Val Phe Arg Val Ser 

260 265 270 

Leu Ala Ser Arg Ala Tyr He Lys Thr Ala Gly Leu Arg Ser Thr 



710 



715 



275 



280 



285 
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He Ala Trp Ala Met Ala Ser Leu Ala Asp He Lys Ala Gly Ala 

290 295 300 

Phe Val Leu Asp Pro Met Cys Gly Leu. Gly Thr He Leu Leu Glu 

305 310 315 

Ala Ala Lys Glu Trp Pro Asp Val Tyr Tyr Val Gly Ala Asp Val 

320 325 330 

Ser Asp Ser Gin Leu Leu Gly Thr Trp Asp Asn Leu Lys Ala Ala 

335 340 345 

Gly Leu Glu Asp Lys He Glu Leu Leu Lys He Ser Val He Glu 

350 355 360 

Leu Pro Leu Pro Ser Glu Ser Val Asp He He He Ser Asp He 

365 370 375 

Pro Phe Gly Lys Lys Phe Lys Leu Gly Lys Asp He Lys Ser He 

380 385 390 

Leu Gin Glu Met Glu Arg Val Leu His Val Gly Gly Thr He Val 

395 400 405 

Leu Leu Leu Ser Glu Asp His His Arg Arg Leu Thr Asp Cys Lys 

410 415 420 

Glu Ser Asn He Pro Phe Asn Ser Lys Asp Ser His Thr Asp Glu 

425 430 435 

Pro Gly He Lys Lys Cys Leu Asn Pro Glu Glu Lys Thr Gly Ala 

440 445 450 
Phe Lys Thr Ala Ser Thr Ser Phe Glu Ala Ser Asn His Lys Phe 

455 460 465 

Leu Asp Arg Met Ser Pro Phe Gly Ser Leu Val Pro Val Glu Cys 

470 475 480 
Tyr Lys Val Ser Leu Gly Lys Thr Asp Ala Phe He Cys Lys Tyr 

485 490 495 
Lys Lys Ser His Ser Ser Gly Leu 

500 

<210> 51 

<211> 410 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 5639578CD1 

<400> 51 

Met Ser Leu Pro Glu Ser Pro His Ser Pro Ala Thr Leu Asp Tyr 

15 10 15 

Ala Leu Glu Asp Pro His Gin Gly Gin Arg Ser Arg Glu Lys Ser 

20 25 30 

Lys Ala Thr Glu Val Met Ala Asp Met Phe Asp Gly Arg Leu Glu 

35 40 45 

Pro He Val Phe Pro Pro Pro Arg Leu Pro Glu Glu Gly Val Ala 

50 55 60 

Pro Gin Asp Pro Ala Asp Gly Gly His Thr Phe His He Leu Val 

65 70 75 

Asp Ala Gly Arg Ser His Gly Ala He Lys Ala Gly Gin Glu Val 

80 85 90 

Thr Pro Pro Pro Ala Glu Gly Leu Glu Ala Ala Ser Ala Ser Leu 

95 100 105 

Thr Thr Asp Gly Ser Leu Lys Asn Gly Phe Pro Gly Glu Glu Thr 
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110 



115 



120 



His Gly Leu Qly Qly Qlu Lys Ala Leu Glu Thr Cys Gly Ala Gly 

125 130 135 

Arg Ser Glu Ser Glu Val He Ala Glu Gly Lys Ala Glu Asp Val 

140 145 150 

Lys Pro Glu Glu Cys Ala Met Phe Ser Ala Pro Val Asp Glu Lys 

155 160 165 

Pro Gly Gly Glu Glu Met Asp Val Ala Glu Glu Asn Arg Ala He 

170 175 180 

Asp Glu Val Asn Arg Glu Ala Gly Pro Gly Pro Gly Pro Gly Pro 

185 190 195 

Leu Asn Val Gly Leu His Leu Asn Pro Leu Glu Ser He Gin Leu 

200 205 210 

Glu Leu Asp Ser Val Asn Ala Glu Ala Asp Arg Ala Leu Leu Gin 

215 220 225 

Val Glu Arg Arg Phe Gly Gin He His Glu Tyr Tyr Leu Glu Gin 

230 235 240 

Arg Asn Asp He He Arg Asn He Pro Gly Phe Trp Val Thr Ala 

245 250 255 

Phe Arg His His Pro Gin Leu Ser Ala Met He Arg Gly Gin Asp 

260 265 270 

Ala Glu Met Leu Ser Tyr Leu Thr Asn Leu Glu Val Lys Glu Leu 

275 280 285 

Arg His Pro Arg Thr Gly Cys Lys Phe Lys Phe Phe Phe Gin Arg 

290 295 300 

Asn Pro Tyr Phe Arg Asn Lys Leu He Val Lys Val Tyr Glu Val 

305 310 315 

Arg Ser Phe Gly Gin Val Val Ser Phe Ser Thr Leu He Met Trp 

320 325 330 

Arg Arg Gly His Gly Pro Gin Ser Phe He His Arg Asn Arg His 

335 340 345 

Val He Cys Ser Phe Phe Thr Trp Phe Ser Asp His Ser Leu Pro 

350 355 360 

Glu Ser Asp Arg He Ala Gin He He Lys Glu Asp Leu Trp Ser 

365 370 375 

Asn Pro Leu Gin Tyr Tyr Leu Leu Gly Glu Asp Ala His Arg Ala 

380 385 390 

Arg Arg Arg Leu Val Arg Glu Pro Val Glu He Pro Arg Pro Phe 

395 400 405 

Gly Phe Gin Cys Gly 



<210> 52 

<211> 1056 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 7509080CD1 

<400> 52 

Met Glu Pro Gly Cys Asp Glu Phe Leu Pro Pro Pro Glu Cys Pro 

15 10 15 

Val Phe Glu Pro Ser Trp Ala Glu Phe Gin Asp Pro Leu Gly Tyr 



410 



20 



25 



30 
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lie Ala Lys 
lie Arg Pro 
Asp Asn Phe 
Glu Ala Gin 
Lys Phe Trp 
Glu Arg Lys 
Thr His Pro 
His Ser lie 
Ser Tyr Ser 
Thr Glu Glu 
lie Tyr Gly 
Lys Asp Lys 
Thr Val Thr 
Ser Thr Leu 
Thr Thr Met 
Asp Ser Tyr 
Lys Leu Leu 
Cys Leu Leu 
Cys Pro Lys 
Phe Gly Phe 
Gly Glu Met 
Val His Met 
Leu Val Ser 
Asp lie His 
Ser Lys Gin 
Gly Trp Asn 
Cys His lie 
Tyr Val Gly 



He Arg Pro 
35 

Pro Ala Asp 
50 

Arg Phe Thr 
65 

Thr Arg Val 
80 

Glu He Gin 
95 

He Leu Asp 

110 
Phe Asp Asn 

125 
Pro Leu Arg 

140 

Arg Arg Ala 

155 
Asp He Glu 

170 
Pro Gly Pro 

185 

Asp Lys Thr 

200 
Val Lys Asp 

215 
Leu Lys Gin 

230 
Gin Leu Arg 

245 
He Cys Gin 

260 
Phe Cys Asp 

275 
Pro Pro Leu 

290 
Cys He Leu 

305 
Glu Gin Ala 

320 
Ala Asp Ser 

335 
Val Pro Thr 

350 
Ser He Glu 

365 
Ser Lys Glu 

380 
Asn Leu Ser 

395 
Leu Asn Val 

410 
Asn Ala Asp 

425 

Met Val Phe 
440 



He Ala Glu 
Trp Gin Pro 
Pro Arg Val 
Lys Leu Asn 
Gly Ser Ser 
Leu Tyr Ser 
Glu Val Lys 
Gin Ser Val 
Lys Arg Leu 
Lys His Pro 
Lys Met Met 
Val His Lys 
Glu Gin Ser 
His Leu Ser 
Lys Asn His 
Val Cys Ser 
Gly Cys Asp 
Pro Glu He 
Ala Glu Cys 
Thr Gin Glu 
Phe Lys Ser 
Glu Leu Val 
Glu Asp Val 
Phe Gly Ser 
Pro Glu Glu 
Met Pro Val 
He Ser Gly 
Ser Ala Phe 



Lys Ser Gly 
40 

Pro Phe Ala 
55 

Gin Arg Leu 
70 

Tyr Leu Asp 
85 

Leu Lys He 
100 

Leu Ser Lys 
115 

Asp Lys Glu 
130 

Gin Pro Ser 
145 

Gin Pro Asp 
160 

Glu Leu Lys 
175 

Gly Leu Gly 
190 

Lys Val Thr 
205 

Gly Gly Gly 
220 

Leu Glu Pro 
235 

Ser Ser Ala 
250 

Arg Gly Asp 
265 

Asp Asn Tyr 
280 

Pro Arg Gly 
295 

Lys Gin Pro 
310 

Tyr Ser Leu 
325 

Asp Tyr Phe 
340 

Glu Lys Glu 
355 

Thr Val Glu 
370 

Gly Phe Pro 
385 

Lys Glu Tyr 
400 

Leu Asp Gin 
415 

Met Lys Val 
430 

Cys Trp His 
445 



He Cys Lys 
45 

Val Glu Val 
60 

Asn Glu Leu 
75 

Gin He Ala 
90 

Pro Asn Val 
105 

Gin Cys Asn 
120 

Tyr Lys Pro 
135 

Lys Phe Ser 
150 

Pro Glu Pro 
165 

Lys Leu Gin 
180 

Leu Met Ala 
195 

Cys Pro Pro 
210 

Asn Val Ser 
225 

Cys Thr Lys 
240 

Gin Phe He 
255 

Glu Asp Asp 
270 

His He Phe 
285 

He Trp Arg 
300 

Pro Glu Ala 
315 

Gin Ser Phe 
330 

Asn Met Pro 
345 

Phe Trp Arg 
360 

Tyr Gly Ala 
375 

Val Ser Asn 
390 

Ala Thr Ser 
405 

Ser Val Leu 
420 

Pro Trp Leu 
435 

He Glu Asp 
450 
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His Trp Ser Tyr Ser 
455 

Thr Trp Tyr Gly Val 
470 

Val Met Lys Met Leu 
485 

Leu Leu His Gin Leu 
500 

Ser His Gly Val Pro 
515 

Phe Val lie Thr Phe 
530 

Gly Tyr Asn Phe Ala 
545 

Leu Pro Ala Gly Arg 
560 

Arg Tyr Cys Val Phe 
575 

Ala Phe Pro Glu Thr 
590 

Lys Glu Met Phe lie 
605 

Ala Leu Leu Glu Lys 
620 

Glu Leu Leu Pro Asp 
635 

Thr Cys Phe Leu Ser 
650 

Leu Val Cys Leu Ser 
665 

Ser Arg Gin Tyr Leu 
680 

Thr Met Leu His Lys 
695 

Trp Ala Asn Lys Val 
710 

Lys Arg Ser Phe Glu 
725 

Glu Arg Arg Phe Pro 
740 

Cys Leu Ser Glu Val 
755 

Val Ser Gly Gin Val 
770 

Thr Glu Leu Arg Val 
785 

Ala Met His Gin He 
800 

Glu Ala Tyr Gin Ala 
815 

Ser Ser Pro Gly Leu 
830 

Leu Gly Val Glu Val 
845 

Glu Gin Ala Gin Trp 
860 



He Asn Tyr Leu His Trp Gly Glu Pro Lys 

460 465 
Pro Ser Leu Ala Ala Glu His Leu Glu Glu 

475 480 
Thr Pro Glu Leu Phe Asp Ser Gin Pro Asp 

490 495 
Val Thr Leu Met Asn Pro Asn Thr Leu Met 

505 510 
Val Val Arg Thr Asn Gin Cys Ala Gly Glu 

520 525 
Pro Arg Ala Tyr His Ser Gly Phe Asn Gin 

535 540 
Glu Ala Val Asn Phe Cys Thr Ala Asp Trp 

550 555 
Gin Cys He Glu His Tyr Arg Arg Leu Arg 

565 570 
Ser His Glu Glu Leu He Cys Lys Met Ala 

580 585 
Leu Asp Leu Asn Leu Ala Val Ala Val His 

595 600 
Met Val Gin Glu Glu Arg Arg Leu Arg Lys 

610 615 
Gly Val Thr Glu Ala Glu Arg Glu Ala Phe 

625 630 
Asp Glu Arg Gin Cys He Lys Cys Lys Thr 

640 645 
Ala Leu Ala Cys Tyr Asp Cys Pro Asp Gly 

655 660 
His He Asn Asp Leu Cys Lys Cys Ser Ser 

670 675 
Arg Tyr Arg Tyr Thr Leu Asp Glu Leu Pro 

685 690 
Leu Lys He Arg Ala Glu Ser Phe Asp Thr 

700 705 
Arg Val Ala Leu Glu Val Glu Asp Gly Arg 

715 720 
Glu Leu Arg Ala Leu Glu Ser Glu Ala Arg 

730 735 
Asn Ser Glu Leu Leu Gin Arg Leu Lys Asn 

745 750 
Glu Ala Cys He Ala Gin Val Leu Gly Leu 

760 765 
Ala Arg Met Asp Thr Pro Gin Leu Thr Leu 

775 780 
Leu Leu Glu Gin Met Gly Ser Leu Pro Cys 

790 795 
Gly Asp Val Lys Asp Val Leu Glu Gin Val 

805 810 
Glu Ala Arg Glu Ala Leu Ala Thr Leu Pro 

820 825 
Leu Arg Ser Leu Leu Glu Arg Gly Gin Gin 

835 840 
Pro Glu Ala His Gin Leu Gin Gin Gin Val 

850 855 
Leu Asp Glu Val Lys Gin Ala Leu Ala Pro 

865 870 
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Ser Ala His Arg Gly Ser Leu Val lie Met Gin Gly Leu Leu Val 

875 880 885 

Met Gly Ala Lys lie Ala Ser Ser Pro Ser Val Asp Lys Ala Arg 

890 895 900 

Ala Glu Leu Gin Glu Leu Leu Thr lie Ala Glu Arg Trp Glu Glu 

905 910 915 

Lys Ala His Phe Cys Leu Glu Ala Arg Gin Lys His Pro Pro Ala 

920 925 930 

Thr Leu Glu Ala lie He Arg Glu Thr Glu Asn He Pro Val His 

935 940 945 

Leu Pro Asn He Gin Ala Leu Lys Glu Ala Leu Thr Lys Ala Gin 

950 955 960 

Ala Trp He Ala Asp Val Asp Glu He Gin Asn Gly Asp His Tyr 

965 970 975 

Pro Cys Leu Asp Asp Leu Glu Gly Leu Val Ala Val Gly Arg Asp 

980 985 990 

Leu Pro Val Gly Leu Glu Glu Leu Arg Gin Leu Glu Leu Gin Val 

995 1000 1005 

Leu Thr Ala His Ser Trp Arg Glu Lys Ala Ser Lys Thr Phe Leu 

1010 1015 1020 

Lys Lys Asn Ser Cys Tyr Thr Leu Leu Glu Val Arg Ser Glu Thr 

1025 1030 1035 

Leu Thr His Ser Leu Phe Phe He Trp Pro Gly Cys Cys Glu Met 

1040 1045 1050 
Ala His He Met Arg Thr 
1055 

<210> 53 

<211> 103 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7505899CD1 

<400> 53 

Met Arg Tyr Val Ala Ser Tyr Leu Leu Ala Ala Leu Gly Gly Asn 
15 10 15 

Ser Ser Pro Ser Ala Lys Asp He Lys Lys He Leu Asp Ser Val 

20 25 30 

He Ser Glu Leu Asn Gly Lys Asn He Glu Asp Val He Ala Gin 

35 40 45 

Gly He Gly Lys Leu Ala Ser Val Pro Ala Gly Gly Ala Val Ala 

50 55 60 

Val Ser Ala Ala Pro Gly Ser Ala Ala Pro Ala Ala Gly Ser Ala 

65 70 75 

Pro Ala Ala Ala Glu Glu Lys Lys Asp Glu Lys Lys Glu Glu Ser 

80 85 90 

Glu Glu Ser Asp Asp Asp Met Gly Phe Gly Leu Phe Asp 

95 100 

<210> 54 

<211> 232 

<212> PRT 

<213> Homo sapiens 
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<220> 

<221> misc_f eature 

<223> Incyte ID No: 7505904CD1 



<400> 54 

Met Glu Tyr Pro Ala Pro Ala Thr Val Gin Ala Ala Asp Gly Gly 
15 10 15 

Ala Ala Gly Pro Tyr Ser Ser Ser Glu Leu Leu Glu Gly Gin Glu 
20 25 30 

Pro Asp Gly Val Arg Phe Asp Arg Glu Arg Ala Arg Arg Leu Trp 
35 40 45 

Glu Ala Val Ser Gly Ala Gin Pro Val Gly Arg Glu Glu Val Glu 
50 55 60 

His Met lie Gin Lys Asn Gin Cys Leu Phe Thr Asn Thr Gin Cys 
65 70 75 

Lys Val Cys Cys Ala Leu Leu lie Ser Glu Ser Gin Lys Leu Ala 
80 85 90 

His Tyr Gin Ser Lys Lys His Ala Asn Lys Val Lys Arg Tyr Leu 
95 100 105 

Ala lie His Gly Met Glu Thr Leu Lys Gly Glu Thr Lys Lys Leu 
110 115 120 

Asp Ser Asp Gin Lys Ser Ser Arg Ser Lys Asp Lys Asn Gin Cys 
125 130 135 

Cys Pro He Cys Asn Met Thr Phe Ser Ser Pro Val Val Ala Gin 
140 145 150 

Ser His Tyr Leu Gly Lys Thr His Ala Lys Asn Leu Lys Leu Lys 
155 160 165 

Gin Gin Ser Thr Lys Val Glu Ala Gly Lys Gly Tyr Pro Cys Lys 
170 175 180 

Thr Cys Lys He Val Leu Asn Ser He Glu Gin Tyr Gin Ala His 
185 190 195 

Val Ser Gly Phe Lys His Lys Asn Gin Ser Pro Lys Thr Val Ala 
200 205 210 

Ser Ser Leu Gly Gin He Pro Met Gin Arg Gin Pro He Gin Lys 
215 220 225 

Asp Ser Thr Thr Leu Glu Asp 
230 



<210> 55 
<211> 508 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7509224CD1 



<400> 55 

Met Thr Ser Leu Met Pro Gly Ala Gly Leu Leu Pro He Pro Thr 
15 10 15 

Pro Asn Pro Leu Thr Thr Leu Gly Val Ser Leu Ser Ser Leu Gly 

20 25 30 

Ala He Pro Ala Ala Ala Leu Asp Pro Asn He Ala Thr Leu Gly 

35 40 45 

Glu He Pro Gin Pro Pro Leu Met Gly Asn Val Asp Pro Ser Lys 

50 55 60 
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lie Asp Glu 
Gin Thr Thr 
Gly Glu Val 
Thr Arg Phe 
Arg Ala Leu 
Lys lie Aan 
Thr Pro Gin 
Val Arg Glu 
Ser Gly Lys 
Thr Arg Ser 
Lys Arg Ser 
Arg Ser Arg 
Lys Arg Ser 
His Ser Arg 
Lys Glu Arg 
Glu Lys Glu 
Arg Asp Lys 
Asp Arg Glu 
Lys Glu Lys 
Asp Arg Ser 
Ser Arg Thr 
Ser Ser Ser 
Arg Ser Pro 
Ser Pro Ser 
Lys Glu Arg 
Thr Ser Met 
Leu Glu Lys 
Glu Pro Asp 



lie Arg Arg 
65 

Thr Ala Asp 
80 

Lys Phe Val 
95 

Ala Phe Val 

110 
Ala Phe Asn 

125 
His Ser Asn 

140 
Ala Ala Ala 

155 
Ala Gin Ser 

170 
Ser Asn Glu 

185 
Lys Ser Arg 

200 
Gin Ser Lys 

215 
Gin Lys Asp 

230 
Lys Ser Arg 

245 
Asp Lys Arg 

260 
Val Lys Glu 

275 
Arg Glu Lys 

290 
Glu Arg Glu 

305 
Arg Glu Arg 

320 
Glu Lys Glu 

335 
Lys Glu lie 

350 
Pro Pro Arg 

365 
Arg Glu Arg 

380 
Arg Thr Ser 

395 
Pro Arg Ser 

410 
Asp His He 

425 
Arg Lys Ser 

440 
Asn Ser Thr 

455 
Ser Ser Val 

470 



Thr Val Tyr 
Gin Leu Leu 
Arg Met Ala 
Glu Phe Ala 
Gly Val Met 
Asn Ala He 
Lys Glu Leu 
Phe He Ser 
Arg Lys Gly 
Ser Ser Ser 
His Arg Ser 
Arg Arg Arg 
Glu Arg Arg 
Lys Asp Thr 
Lys Asp Arg 
Glu Lys Glu 
Lys Asp Arg 
Glu Lys Glu 
Gin Asp Lys 
Asp Glu Lys 
Ser Tyr Asn 
Arg Arg Arg 
Lys Thr He 
Arg Asn Lys 
Ser Glu Arg 
Ser Asn Asp 
Ser Leu Lys 
Ser Lys Glu 



Val Gly Asn 
70 

Glu Phe Phe 
85 

Gly Asp Glu 
100 

Asp Gin Asn 
115 

Phe Gly Asp 
130 

Val Lys Pro 
145 

Glu Glu Val 
160 

Ala Ala lie 
175 

Gly Arg Ser 
190 

Lys Ser His 
205 

Arg Ser His 
220 

Ser Lys Ser 
235 

Lys Ser Arg 
250 

Arg Glu Lys 
265 

Glu Lys Glu 
280 

Arg Gly Lys 
295 

Glu Lys Asp 
310 

His Glu Lys 
325 

Glu Lys Glu 
340 

Arg Lys Lys 
355 

Ala Ser Arg 
370 

Arg Ser Arg 
385 

Lys Arg Lys 
400 

Lys Asp Lys 
415 

Arg Glu Arg 
430 

Arg Asp Gly 
445 

Glu Lys Glu 
460 

Val Asp Asp 
475 



Leu Asn Ser 
75 

Lys Gin Val 
90 

Thr Gin Pro 
105 

Ser Val Pro 
120 

Arg Pro Leu 
135 

Pro Glu Met 
150 

Met Lys Arg 
165 

Glu Pro Glu 
180 

Arg Ser His 
195 

Ser Arg Arg 
210 

Asn Arg Ser 
225 

Pro His Lys 
240 

Ser Arg Ser 
255 

He Lys Glu 
270 

Arg Glu Arg 
285 

Asn Lys Asp 
300 

Lys Glu Lys 
315 

Asp Arg Asp 
330 

Arg Glu Lys 
345 

Asp Lys Lys 
360 

Arg Ser Arg 
375 

Ser Ser Ser 
390 

Ser Ser Arg 
405 

Lys Arg Glu 
420 

Glu Arg Ser 
435 

Lys Glu Lys 
450 

His Asn Lys 
465 

Lys Asp Ala 
480 
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Pro Arg Thr Glu Glu Asn Lys lie Gin 
485 

Leu Aan Glu Glu Asn Leu Ser Thr Lys 
500 

<210> 56 

<211> 274 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc — feature 

<223> Incyte ID No: 7505922CD1 

<400> 56 



Met 


Phe 


Gin Ala Ala Glu Arg Pro 


Gin 


GlU 


Trp Ala Met Glu Gly 


1 




5 




1 ft 
10 


15 


Pro 


Arg 


Asp Gly Leu Lys Lys Glu Arg 


Leu 


Leu Asp Asp Arg His 






20 




zb 


30 


Asp 


Ser 


Gly Leu Asp Ser Met Lys 


Asp 


GlU 


Glu Tyr Glu Gin Met 






35 




a n 
4U 


A C 

4b 


Val 


Lys 


Glu Leu Gin Glu lie Arg 


Leu 


ulU 


Jt*ro Gin Glu Val Pro 






50 




bb 


60 


Arg Gly 


Ser Glu Pro Trp Lys Gin Gin 


Leu 


Thr Glu Asp Gly Asp 






65 




/U 


7b 


Ser 


Phe 


Leu His Leu Ala lie lie 


His 


GlU 


Glu Lys Ala Leu Thr 






80 




8b 


90 


Met 


Glu 


Val lie Arg Gin Val Lys Gly 


Asp 


Leu Ala Phe Leu Asn 






95 




100 


105 


Phe 


Gin 


Asn Asn Leu Gin Gin Thr 


Pro 


Leu 


His Leu Ala Val He 






110 




115 


120 


Thr 


Asn 


Gin Pro Glu lie Ala Glu 


Ala 


Leu 


Leu Gly Ala Gly Cys 






125 




130 


135 


Asp 


Pro 


Glu Leu Arg Asp Phe Arg Gly 


Asn 


Thr Pro Leu His Leu 






140 




145 


150 


Ala 


Cys 


Glu Gin Gly Cys Leu Ala Ser 


Val 


Gly Val Leu Thr Gin 






155 




160 


165 


Ser 


Cys 


Thr Thr Pro His Leu His 


Ser 


He 


Leu Lys Ala Thr Asn 






170 




175 


180 


Tyr Asn 


Gly His Thr Cys Leu His 


Leu 


Ala 


Ser He His Gly Tyr 






185 




190 


195 


Leu Gly 


He Val Glu Leu Leu Val 


Ser 


Leu 


Gly Ala Asp Val Asn 






200 




205 


210 


Ala 


Gin 


Leu Thr Trp Gly Arg Pro 


Ser 


Thr 


Arg He Gin Gin Gin 






215 




220 


225 


Leu Gly 


Gin Leu Thr Leu Glu Asn 


Leu 


Gin 


Met Leu Pro Glu Ser 






230 




235 


240 


Glu 


Asp 


Glu Glu Ser Tyr Asp Thr 


Glu 


Ser 


Glu Phe Thr Glu Phe 






245 




250 


255 


Thr 


Glu 


Asp Glu Leu Pro Tyr Asp 


Asp 


Cys 


Val Phe Gly Gly Gin 






260 




265 


270 


Arg 


Leu 


Thr Leu 









<210> 57 
<211> 527 



His Asn Gly Asn Cys Gin 
490 495 
Thr Lys Ala Val 
505 
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<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7507695CD1 

<400> 57 



Met Thr 


Ala 


Glu 


Leu 


Arg 


Glu 


Ala Met Ala 


Leu 


Ala 


Pro Trp Gly 


1 






5 






10 






15 


Pro Val 


Lys 


Val 


Lys 


Lys 


Glu 


Glu Glu Glu 


Glu 


Glu 


Asn Phe Pro 








20 






25 






30 


Gly Gin Ala 


Ser 


Ser 


Gin 


Gin 


Val His Ser 


Glu 


Asn 


He Lys Val 








35 






40 






45 


Trp Ala 


Pro 


Val 


Gin 


Gly 


Leu 


Gin Thr Gly Leu Asp Gly Ser Glu 








50 






55 






60 


Glu Glu 


Glu 


Lys 


Gly 


Gin 


Asn 


He Ser Trp 


Asp Met 


Ala Val Val 








65 






70 






75 


Leu Lys 


Ala 


Thr 


Gin 


Glu 


Ala 


Pro Ala Ala Ser Thr Leu Gly Ser 








on 
ou 






85 






90 


Tyr Ser 


Leu 


Pro 


Gly 


Thr 


Leu 


Ala Lys Ser 


Glu 


He 


Leu Glu Thr 








95 






100 






105 


His Gly Thr 


Met 


Asn 


Phe 


Leu 


Gly Ala Glu Thr Lys 


Asn Leu Gin 








110 






115 






120 


Leu Leu 


Val 


Pro 


Lys 


Thr 


Glu 


He Cys Glu Glu Ala Glu Lys Pro 








125 






130 






135 


Leu lie 


He 


Ser 


Glu 


Arg 


He 


Gin Lys Ala 


Asp 


Pro 


Gin Gly Pro 








140 






145 






150 


Glu Leu Gly 


Glu 


Ala 


Cys 


Glu 


Lys Gly Asn Met 


Leu 


Lys Arg Gin 








155 






160 






165 


Arg lie 


Lys 


Arg 


Glu 


Lys 


Lys 


Asp Phe Arg 


Gin 


Val 


lie Val Asn 








170 






175 






180 


Asp Cys 


His 


Leu 


Pro 


Glu 


Ser 


Phe Lys Glu Glu Glu Asn Gin Lys 








185 






190 






195 


Cys Lys 


Lys 


Ser 


Gly 


Gly Lys 


Tyr Ser Leu 


Asn 


Ser Gly Ala Val 








200 






205 






210 


Lys Asn 


Pro 


Lys 


Thr 


Gin 


Leu 


Gly Gin Lys 


Pro 


Phe 


Thr Cys Ser 








215 






220 






225 


Val Cys 


Gly 


Lys 


Gly 


Phe 


Ser 


Gin Ser Ala 


Asn 


Leu 


Val Val His 








230 






235 






240 


Gin Arg 


He 


His 


Thr 


Gly Glu 


Lys Pro Phe 


Glu 


Cys 


His Glu Cys 








245 






250 






255 


Gly Lys Ala 


Phe 


He 


Gin 


Ser 


Ala Asn Leu 


Val 


Val 


His Gin Arg 








260 






265 






270 


lie His 


Thr 


Gly 


Gin 


Lys 


Pro 


Tyr Val Cys 


Ser 


Lys 


Cys Gly Lys 








275 






280 






285 


Ala Phe 


Thr 


Gin 


Ser 


Ser 


Asn 


Leu Thr Val 


His 


Gin 


Lys He His 








290 






295 






300 


Ser Leu 


Glu 


Lys 


Thr 


Phe 


Lys 


Cys Asn Glu 


Cys 


Glu 


Lys Ala Phe 








305 






310 






315 


Ser Tyr 


Ser 


Ser 


Gin 


Leu 


Ala 


Arg His Gin 


Lys 


Val 


His He Thr 








320 






325 






330 


Glu Lys 


Cys 


Tyr 


Glu 


Cys 


Asn 


Glu Cys Gly Lys Thr 


Phe Thr Arg 








335 






340 






345 


Ser Ser 


Asn 


Leu 


He 


Val 


His 


Gin Arg He His Thr Gly Glu Lys 








350 






355 






360 
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Pro Phe 
Asn Leu 
Glu Cys 
He Val 
Ser Glu 
His Gin 
Cys Gly 
Arg He 
Lys Ala 
His Thr 
Phe He 
Val Glu 



Ala Cys 
He Val 
Lys Glu 
His Gin 
Cys Gly 
Arg He 
Lys Ala 
His Asn 
Phe Arg 
Gly Glu 
Ser Asn 



Asn Asp 
365 

His Gin 
380 

Cys Gly 
395 

Arg He 
410 

Lys Ala 
425 

His Ser 
440 

Phe Thr 
455 

Gly Glu 
470 

Gin Arg 
485 

Lys Pro 
500 

Ser His 
515 



Cys Gly Lys 
Arg Ser His 
Lys Ala Phe 
His Thr Ala 
Phe Ser Gin 
Gly Asp Leu 
Cys Ser Ser 
Lys Pro Tyr 
Ser Ser Leu 
Tyr Glu Cys 
Leu Met Arg 



Ala Phe 
370 

Thr Gly 
385 

Ser Cys 
400 

Glu Lys 
415 

Leu Ser 
430 

Pro Tyr 
445 

Tyr Leu 
460 

Thr Cys 
475 

Thr Val 
490 

Glu Lys 
505 

Tyr His 
520 



Thr Gin 
Glu Lys 
Phe Ser 
Pro Tyr 
Cys Leu 
Val Cys 
Leu He 
Asn Glu 
His Gin 
Cys Gly 
Arg Thr 



Ser Ala 

375 
Pro Tyr 

390 
His Leu 

405 
Asp Cys 

420 
He Val 

435 
Asn Glu 

450 
His Gin 

465 
Cys Gly 

480 
Arg Thr 

495 
Ala Ala 

510 
His Leu 

525 



<210> 58 

<211> 2629 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 7506140CB1 

<400> 58 

cccttccctc ccccatacct agcccgagtc tgagccctaa cgagaaggct gggcctaggc 60 
cgctggatgc tggagtgaaa ggaagggaga aagggaaaaa gcgggaagag tcgagaaggc 120 
tgagtgttaa gaggccaagt gcgacgcgcg tatccgggca gacggactga cggacgggcc 180 
cgtgcttctg ccgcggctgc ggcgcccgcg cgagtcgcgt ctaagcggcg gcggcggtgg 240 
cagcggcgga aaccgaaggg gagccatggc ggctgacagt cgggaggaga aagatggaga 300 
acttaatgtt ctggatgata ttttaactga agtaccagaa caagatgatg aactgtataa 360 
tccagagagt gaacaagata aaaatgagaa aaagggatca aaaagaaaaa gtgatcgaat 420 
ggaatctact gataccaaac gacaaaagcc ttctgtccat tcaagacaac tggtttctaa 480 
gccactgagc tcatctgtta gcaataacaa aagaatagtt agtacaaaag gaaagtcagc 540 
cacagagtat aaaaatgagg aatatcaaag atctgaaaga aacaagcgtc tagatgctga 600 
tcggaaaatt cgtctatcaa gtagtgcctc cagagaacct tataagaatc aacctgaaaa 660 
aacctgtgtc cggaaaaggg atcctgaaag gagggccaaa tctcctacgc cagatggttc 720 
tgagagaatt gggcttgaag tggatagacg tgcaagcaga tccagccagt cttctaagga 780 
agaagtgaac tctgaagaat atggctctga ccatgagact ggcagcagtg gttcttctga 840 
tgagcaaggg aacaacactg agaatgagga ggaaggagtg gaagaagatg tggaggaaga 900 
tgaagaggag gaggaggagg aggaagaaga agaatatgaa caggatgaga gagaccagaa 960 
agaggaggga aatgattatg acactcgaag tgaggccagt gactctggtt ctgaatctgt 1020 
ttccttcaca gatgggtctg tcagatctgg ttcaggcaca gatggatcag atgagaaaaa 1080 
gaaggaaagg aagagagcta gaggcatatc tccaattgtt tttgatagaa gtggaagctc 1140 
tgcatcagag tcatatgcag gttcagaaaa gaagcatgag aaattatcat cttccgttcg 1200 
tgctgtccga aaagatcaaa ccagtaaact caaatatgtg cttcaagatg caagattttt 1260 
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cctcataaag agtaacaacc atgagaatgt gtctcttgcc aaagcgaagg gtgtatggtc 1320 
cacgctccct gtaaatgaga agaaattaaa tcttgcattt agatctgcaa ggagtgttat 1380 
cttaatattt tctgtcagag agagtggaaa atttcaaggg tttgcaagac tttcttcaga 1440 
atcacatcac ggaggatctc ctatacactg ggtgcttcca gcaggaatga gtgctaaaat 1500 
gctgggaggt gtctttaaaa ttgactggat ttgcaggcgt gaattaccct tcactaagtc 1560 
ggctcatctc accaatcctt ggaatgaaca taaaccagta aagatcggac gtgatggaca 1620 
ggaaattgaa cttgaatgtg gaacccagct ttgtcttctg tttccccccg atgaaagtat 1680 
tgacttgtat caggtcattc ataaaatgcg tcacaagaga agaatgcatt ctcagccccg 1740 
atcacgagga cgtccatccc gtcgagaacc agtccgggat gtgggaaggc gtcgaccaga 1800 
agattatgat attcataaca gcagaaagaa accaaggatt gactatcccc ctgagtttca 1860 
ccagagacca gggtatttaa aggatccacg ataccaggaa gtggacagtt tcacaaatct 1920 
tattcccaac agacgatttt caggagttcg ccgagatgtg tttttaaatg ggtcctacaa 1980 
tgattatgtg agggaatttc ataacatggg accaccacca ccttggcaag gaatgccccc 2040 
ttacccagga atggaacaac ctccacacca tccttactat cagcaccatg ctccacctcc 2100 
tcaagctcat cccccttact caggacatca tccagtacca catgaagcaa gatacagaga 2160 
taaacgagta catgattatg atatgagggt ggatgatttc cttcgtcgca cacaagctgt 2220 
tgtcagtggc cggagaagta gaccccgtga aagagaccgg gaacgagagc gagaccgccc 2280 
tagagataac agacgagaca gagagcgaga tagaggacgt gatagagaaa gagaaagaga 2340 
gcgattatgt gatcgagaca gagaccgagg ggagagaggt cgatatagaa gataatgggc 2400 
ttttggaagc actgattgtt taaagataca aaaaatcttg tatttttttt ttgtgtgtgt 2460 
ttacaagtag taaatttatt ttcagctgtc tgcctatgaa gttcattgtg tagaaggatt 2520 
tattatgacc ccctttgttc caagcatgca gtatcataag aactggaaaa actcaaatcc 2580 
gccaaaaatc cacagctgac agttgaattg acacttttat tggggcaga 2629 

<210> 59 
<211> 3902 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 1889415CB1 

<400> 59 

aggcgctgcg gcagcgcgga gcttgtaaac agatccggcc gcaggtgacc atgtgaacta 60 
cctgctcccg ggacgcttat tgtccttctc tcgatccgcg ccacaaagga gctcggcggt 120 
cggggcgcgg agggacaggc gaacgagccg ggagagccgg ccggcgcact aagatggctg 180 
aaggcgcccg gcgagggtga gcggggggcg cggcgcagcc agcggggagt cctcgggcgg 240 
gccgggccgg cggcccttcc ccgcggcgag aagccggagc caccatgtcg ttcgcgctgg 300 
aggagacgct cgagtcggac tgggtggctg tgcggcccca tgtgttcgac gagcgcgaga 360 
aacacaaatt cgtctttatt gtggcctgga acgagattga gggcaagttt gccataacct 420 
gccacaaccg gacggcccag aggcagagga gcggctcccg ggagcaagtg ggggcgcgag 480 
ggggcgccga ggccggcgga gctgcgtccg acgggagccg cgggcccggc agcccggcgg 540 
gcaggggtcg gcccgaggcc actgcctctg caactctggt taggagcccc gggccccggc 600 
ggagctcggc ctgggcggag ggcggctctc ctcggagcac tcgcagcctt ctgggggacc 660 
cgcggctgcg gagtcctggc agcaaagggg cggagagtcg tcttaggagc ccagtgcggg 720 
ccaaacccat cccgggtcag aaaacatctg aagccgacga tgcggcgggg gcagccgctg 780 
cagcagcccg gccggcgccc agagaggccc aggtgtcctc tgtacggata gtgagcgcct 840 
ctgggacggt ctccgaggag atagaggtgc tggaaatggt gaaggaggac gaggcacctc 900 
tggcgctctc ggacgcggag cagccgccgc ccgccaccga gctggagtct ccggccgaag 960 
agtgcagctg ggccggactg ttttctttcc aggacctgcg cgccgtgcac cagcagctgt 1020 
gctcggtgaa ctcgcagttg gagccgtgcc tgccggtgtt ccccgaggaa ccttcgggca 1080 
tgtggactgt gctgtttggg ggcgcccccg agatgaccga gcaggaaatc gacactctgt 1140 
gttaccagct ccaggtctac ctgggccacg gcctggacac ctgcggctgg aagatcctct 1200 
cccaggtgct cttcaccgag accgatgatc ccgaggagta ttacgaaagc ctcagcgagc 1260 
tgcggcagaa gggctacgaa gaagtgcttc agcgggccag gaagcgcatc caggagctct 1320 
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tggataagca caagaataca gagagcatgg 
atgaagccta cagcagcctt gcagaagcta 
cattccgaga catgagagaa cttgccatgc 
agaatgatta tctgggacct cgaagaattg 
agcggaaagc tcacatggct gtactgtcta 
taacagctaa agctcaaaaa gctgtgtatg 
gtaaagcatc atgggcagcg gctgctgaac 
aggaaacttt gcagatgatg agagctaaag 
taaaggaaga gatgcagagt ttgcggggtg 
tagaagctga ttattatgat ctgcaacttc 
agtgtgaaga gttactattg acagcgcaac 
aaagagatga agtggtatac tatgacactt 
aagagatggc agcatctgcg tacttacaga 
cacgccagct ggaagcaaga cgtggacggg 
aaaaggaaat atgtattgca aaacacaatg 
aatatagaac ccatcacaca gtacaactaa 
aaagtgcctg ggttagccaa gagagacaga 
agaggtatcc tgggcaagtc atacttaaat 
gaaaaggtgc agcaagtcct gttctccaag 
tacaggtaga agagaaaact gaagaggtgg 
agacaacaga accccagagc cttgtgcaac 
ccacctcatt acctctcagt ggtgttacct 
ttttgaataa caacctcgaa ccatgttctg 
ctccaacacc accacctccc ccacctcctc 
ttgctaagga cagtggccca gagacactgg 
acaagaggat accaaagtca gccagtgccc 
tggtcagtgc acggaagaag ctcagaaaga 
gctcacccat ggatgaggtg ctagcctcct 
ttgaacagcg aactctgcct ccttttcctg 
aaataaggaa aggggtaaaa ttgaagaagg 
cacttctacc cgatacagac cctctaacac 
aagaagcatc cccagagtca gaggacgaag 
actaacaagt gacataacag aagaaaaata 
aaatgacagt ttaagcactt ggttcctcag 
ttcagtggtg tgaagaaagg aagcacaatt 
agatgatggt taacatgaat tttacatgtg 
cttctggaga tttctgtttt ttaagcacat 
taatggccag ttaagctagg gcttgctgta 
tactgtgagc tgtattggga ctgctttgag 
tgaaaaaata gcttcttcca tatcagctta 
tatcatcatg atagctgtaa gtcaggcatt 
cgcattactc atttccaaag gatttttcat 
atttatggag atacatctat taatatgttc 
tt 



tggagcttct ggacttgtat cagttggagg 1380 
caaccgaact ctatcagtat ttactacagc 1440 
tacgaagaca gcagatcaag atttccatgg 1500 
agagtctaca aaaagaagat gctgattggc 1560 
ttcaagatct tactgtcaag tactttgaaa 1S20 
atcgaatgcg agctgatcag aagaaatttg 1680 
ggatggaaaa actccagtat gcagtttcta 1740 
agatatgctt ggaacagcgg aaacatgcac 1800 
gtacagaagc gatagcacga ttggatcagt 1860 
agttgtatga agtacagttt gaaatcttga 1920 
tagaaagcat caaaagactt atatcagaaa 1980 
acgaaagcat ggaggccatg ctggagaagg 2040 
gagaagagct gcagaaactt cagcagaaag 2100 
Cttctgccaa gaaatcctac ctcagaaata 2160 
aaatccaaca gcgcactcgg attgaagatg 2220 
gagaaaaatt acatgatgaa gaagaaagaa 2280 
gaacactgga tagacttcga acatttaaac 2340 
caaccagatt acgactagct catgcaagaa 2400 
aggatcattg tgactcttta ccaagtgtgt 2460 
gagaaggaag agtcaagcgt gggccatcac 2520 
ttgaagatac ttcattaaca caacttgaag 2580 
ctgaactgcc tcccactata tctcttccac 2640 
ttaccataaa tccactccca tcccctcttc 2700 
cccctccccc accaccacca cctctgcctg 2760 
agaaagatct gcctagaaag gaggggaatg 2820 
cctcagcaca cctctttgac agcagccagc 2880 
ctgctgaagg tttgcagagg aggagagtga 2940 
tgaagcgtgg tagttttcat ctgaaaaagg 3000 
atgaagatga tagtaataat atcttggcac 3060 
tacagaagga tgttttgaga gaatccttca 3120 
ggagcatcca tgaagctctt agaagaatta 3180 
aggaggcttt accttgcaca gactgggaga 3240 
tcattcaaga ttggttctga ttcttttgga 3300 
ttggataaca ttagatccaa agtgcattgg 3360 
ggcaggttat cactttccag tcgttccaat 3420 
caatgttcct gactgcaatg aagaaaaggc 3480 
tctccttcca tccacatatt tacatccttg 3540 
agttggaaga acactgggtt gacagagatc 3600 
aaccatcttt caatgcactg aaaagtcatc 3660 
tattaaatac tgtgacactc tgagaaagtt 3720 
aaaatcaaat ggaaatacac ctaaaatgtc 3780 
aaagatgttc agcaacatct gaaaagctgc 3840 
acaacatgat acatttaaca tggcatattt 3900 

3902 



<210> 60 

<211> 2993 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 7506047CB1 



<400> 60 

cgctcgctcg ctgtcgcaca gactcaccgt cccttgtcca attatcatat tcatcgaccc 60 
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gcaagatatc accgtgtgtg cactcgcgtg 
agagagagag agatagagag agagagagag 
ccctacaccg cggtcccggg attcttgagc 
cacaataacc gtccagtgat gcctgaccat 
aagaggagaa gagttgacat tggagtgaaa 
gctaaggcaa gagcaacgtt ttttagtgcc 
gagtattcag tggtgcagca tgcagatggg 
aaaaataaca tgaacaaaaa tggtggcacg 
agtacaggct ccgaagtaca tcaggaggat 
ccagagtgtc tttccccttt tggcaggcct 
tgtgatgagc acctgagagc aaagcgcgcc 
cattccccca gtgtggcatt aaggggcaat 
gtgagtcccc gagaaagtta cagagaaaac 
caacagagtt tccagcagct ggtttcagcc 
cagctgaaac agcagctgga ggacatgcag 
taccaaatct atgacagcac tgattcggaa 
agcatgcgct cggagatcct ggatgccagg 
gagatgtgcg agctagaccc aggacagttt 
caggaaatgg ctgaaaacaa gccgaagcga 
ccaaactcct tacaaccgga aggcaaacat 
actgccatgt cgcaagttgt ggacactgtg 
caggttcctc aggtcttccc acctctccag 
gaaaaccaca atttccacac cgccaaccag 
ccgaaccccc tggacacctt tggcaatgtg 
gcactgcccc tggttgtccg caaaaactcc 
ggcggccacc accagcccct gcaccagtcg 
tccaccttcc gccacccctt cccccttccc 
ggtgctccct ccggctcctt ctctggaaaa 
actagggata ccacgagtct gaggaccaag 
tgttcaccag cacacccgcc cagcaccgcc 
gagtgcggcg atcttcaaga tatgtctgaa 
gaaggattgt cacccaatca cttgaaaaaa 
cccagctcca atatgctgaa gacctacttc 
tctcagctca tcaagtggtt tagcaatttc 
tacgcacgtc aagccatcaa cgatggggtc 
gactgtgagc tgtacagggc tctgaacatg 
ccagagagat tcctggaagt tgctcagatc 
gcaggcaaag atgttgatcc ttcctggaag 
gatagtgaag tccctgagat tttcaaatcc 
tagaaatttc aacaactctt tttgaatgta 
agttatatgt gtctagattt tgatttcata 
tgaaatcagc tggtaattcc tcctcatcac 
caaggggatg gttgttttct ttctgccttt 
tggacactta aaatagggtt aattttcagg 
tattagcaat gaagggaatt tgttaacgat 
tggcggttgg ctgaggaaaa cccatgacac 
ttgtttctac tgctaagaag gtctgaaaat 
tttgtttggt ttgaactcaa tcagtagctt 
acagatgagc agctcactta tccaaagacc 

<210> 61 

<211> 679 

<212> DNA 

<213> Homo sapiens 

<220> 



ttttcctctc tctgccgggg gaaaaaaaag 120 
agagagagag agaggctcgg tcccactgct 180 
tgtgcccagc tgacgagctt ttgaagatgg 240 
gacagcacag ccctcttaag ccggcaaacc 300 
aggacggtag ggacagcatc tgcatttttt 360 
atgaatcccc aaggttctga gcaggatgtt 420 
gaaaagtcaa atgtactccg caagctgttg 480 
gagcccagtt .tccaagccag cggtctctct 540 
atatgcagca actcttcaag agacagcccc 600 
actatgagcc agtttgatat ggatcgctta 660 
cgggttgaga atataattcg gggtatgagc 720 
gaaaatgaaa gagagatggc cccgcagtct 780 
aaacgcaagc aaaagcttcc ccagcagcag 840 
cgaaaagaac agaagcgaga ggagcgccga 900 
aaacagctgc gccagctgca ggaaaagttc 960 
aatgatgaag atggtaacct gtctgaagac 1020 
gcccaggact ctgtcggaag gtcagataat 1080 
attgaccgag ctcgagccct gatcagagag 1140 
gaaggcaaca acaaagaaag agaccatggg 1200 
ttggctgaga ccttgaaaca ggaactgaac 1260 
gtcaaagtct tttcggccaa gccctcccgc 1320 
atcccccagg ccagatttgc agtcaatggg 1380 
cgcctgcagt gctttggcga cgtcatcatt 1440 
cagatggcca gttccactga ccagacagaa 1500 
tctgaccagt ctgcctccgg ccctgccgct 1560 
cctctctctg ccaccacggg cttcaccacg 1620 
ttgatggcct atccatttca gagcccatta 1680 
gacagagcct ctcctgaatc cttagactta 1740 
atgtcatctc accacctgag ccaccaccct 1800 
gaagggctct ccttgtcgct cataaagtcc 1860 
atatcacctt attcgggaag tgcaatgcag 1920 
gcaaagctca tgttttttta tacccgttat 1980 
tccgacgtaa agttcaacag atgcattacc 2040 
cgtgagtttt actacattca gatggagaag 2100 
accagtactg aagagctgtc tataaccaga 2160 
cactacaata aagcaaatga ctttgaggtt 2220 
acattacggg agtttttcaa tgccattatc 2280 
aaggccatat acaaggtcat ctgcaagctg 2340 
ccgaactgcc tacaagagct gcttcatgag 2400 
tgaagagtag cagtcccctt tggatgtcca 2460 
tatatgtgta tgggaggcat ggatatgtta 2520 
gtttctctca ttttcttttg ttttccattg 2580 
agtttgcttt tgcccaaggc ccttaacatt 2640 
gaaaaagaat gttggcgtgt gtaaagtctc 2700 
gcatccactt gattgatgac ttattgcaaa 2760 
agcacaactc tacagacagt gatgtgtctc 2820 
ttaatgaaac cacttcatac atttaagtat 2880 
ttccttacat gtttaaaaat aattccaatg 2940 
ccatttttcg agcgacatcg gtg 2993 
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<221> misc_feature 

<223> Incyte ID No: 7505849CB1 

<400> 61 

ccttgacctc tggcccaaga tggtggcgcc cagagcttcg ctcttgctgc tcccctgagg 60 
tgaactgaag ccagcagccc cgcatcatgt caaagctcgg ccgggccgcc cggggcctca 120 
ggaagcccga gagaggcgtt tccatcaacc agttttgcaa ggagttcaat gagaggacaa 180 
aggacatcaa ggaaggcatt cctctgccta ccaagatttt agtgaagcct gacaggacat 240 
ttgaaattaa gattggacag cccactgttt cctacttcct gaaggcagca gctgggattg 300 
aaaagggggc ccggcaaaca gggaaagagg tggcaggcct ggtgaccttg aagcatgtgt 360 
atgagattgc ccgcatcaaa gctcaggatg aggcatttgc cctgcaggat gtacccctgt 420 
cgtctgttgt ccgctccatc atcgggtctg cccgttctct gggcattcgc gtggtgaagg 480 
acctcagttc agaagagctt gcagctttcc agaaggaacg agccatcttc ctggctgctc 540 
agaaggaggc agatttggct gcccaagaag aagctgccaa gaagtgaccc ttgccccacc 600 
aactcccaga tttcaaagga ggtagttgca aaagctgtgc ccaaggggag gaaggaggtc 660 
acaccaatat gatgatggt 679 

<210> 62 

<211> 918 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 7505972CB1 

<400> 62 

gcgttaggag gctcgggtcg ttgtggtgcg ctgtcttccc gcttgcgtca gggacctgcc 60 
cgactcagtg gccgccatgg catcagatga aggcaaactt tttgttggag ggctgagttt 120 
tgacaccaat gagcagtcgc tggagcaggt cttctcaaag tacggacaga tctctgaagt 180 
ggtggttgtg aaagacaggg agacccagag atctcgggga tttgggtttg tcacctttga 240 
gaacattgac gacgctaagg atgccatgat ggccatgaat gggaagtctg tagatggacg 300 
gcagatccga ggacggggcc gtgggttctc tagaggagga ggggaccgag gctatggggg 360 
gaaccggttc gagtccagga gtgggggcta cggaggctcc agagactact atagcagccg 420 
gagtcagagt ggtggctaca gtgaccggag ctcgggcggg tcctacagag acagttacga 480 
cagttacgct acacacaacg agtaaaaacc cttcctgctc aagatcgtcc ttccaatggc 540 
tgtgtgttta aagattgtgg gagcttcgct gaacgttaat gtgtagtaaa tgcacctcct 600 
tgtattccca ctttcgtagt catttcggtt ctgatcttgt caaacccagc ctgaccgctt 660 
ctgacgccgg gatggcctcg ttactagact tttcttttta aggaagtgct gttttttttt 720 
gagggttttc aaaacatttt gaaaagcatt tacttttttt agccaggagc catgagtttt 780 
tccaaaaaat cggggggttg tgtgggggtt tttggggttt tggttttagt tttttgggtt 840 
gcgttggcct tttttttttt tagtgggggt tggtccccat gaagtggggt gcccccactc 900 
acttctctga aggacgga 918 

<210> 63 
<211> 2010 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7505991CB1 

<400> 63 

cggagacgcg aggaggggag cagagaacgc gctcgcgtag aggaattccg ggtcctcctc 60 
ctcgccctac cttggtttac ctgcagccgc ctagtcctct tctccttctc tttgatgatt 120 
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cagggcctcg gcttccctca cgatattgca gaaagacaca gctttcctct tcctctccaa 180 
accacctcga agcagggtta aggcataatt ggaatatcat tcggagaaag tgtcatggaa 240 
gttctgcgtc cacagcttat aagaattgat ggccggaatt acaggaagaa tccagtccaa 300 
gaacagacct atcaacatga agaagatgaa gaggacttct atcaaggctc catggagtgt 360 
gctgatgagc cctgtgatgc ctacgaggtg gagcagaccc cacaaggatt ccggtctact 420 
ttgagggccc ccagcttgct ctataagcat atagttggaa agagagggga cactaggaag 480 
aaaatagaaa tggagaccaa aacttctatt agcattccta aacctggaca agacggggaa 540 
attgtaatca ctggccagca tcgaaatggt gtaatttcag cccgaacacg gattgatgtt 600 
cttttggaca cttttcgaag aaagcagccc ttcactcact tccttgcctt tttcctcaat 660 
gaagttgagg ttcaggaagg attcctgaga ttccaggagg aagtactggc gaagtgctcc 720 
atggatcatg gggttgacag cagcattttc cagaatccta aaaagcttca tctaactatt 780 
gggatgttgg tgcttttgag tgaggaagag atccagcaga catgtgagat gctacagcag 840 
tgtaaagagg aattcattaa gctacaagaa ttagttgatc gagtgctgga acgttttcag 900 
gcatctggac taatagtgaa agagtggaat agtgtgaaac tgcatgctac agttatgaat 960 
acactattca ggaaagaccc caatgctgaa ggcaggtaca atctctacac agcggaaggc 1020 
aaatatatct tcaaggaaag agaatcattt gatggccgaa atattttaaa gttgtttgag 1080 
aacttctact ttggctccct aaagctgaat tcaattcaca tctctcagag gttcaccgta 1140 
gacagctttg gaaactacgc ttcctgtgga caaattgact tctcctgagg tggatcttgg 1200 
aaagcactag aaactaaaca tcttcaccag gtgctgaaga aaagtgtctt cgttttaatt 1260 
gccaagcagg gatgtggaca tttggatggt gactttcctg ggtggttccc catagattca 1320 
ccattgcctc taatggtgtc tacacccgtc atactaccag ctgagatggt ggtgggcata 1380 
aggagaattt gtgcctataa cccttagtgt gttctggttt tttttctttt aatttttaaa 1440 
ttgtcgtaaa atactcataa aacatactgt cttcaccatt tttaagtgca cagttcagta 1500 
acgttaactg ttaatacatt cataatgctg tgtggccgtc accgccgtcc atctccatag 1560 
gcttctcagc ttgtaaaatg gaaactgtac ccattaaaca gtaattccca ctcctcccag 1620 
cccccgcagc caccattctg ctttctgtct ctctggtttt gactattctc agtatctcat 1680 
ataagtggaa tcatacagtg acttgtcttt ttgtgactgg cttatttcac ttagcataat 1740 
gtcctcaagg ttcatccatg ttgtgtcagg tgacaggatt tccttcctgt attatacata 1800 
taatggaatg ttccgttaca tgtgcgtacc acactctgtt cctccatcag tgaacacttg 1860 
gcttgcttcc tcttgactat tgggagtagt gctaatacag acacgggggt gcaaatatct 1920 
ctttgagact ctgcctttaa ttcctttacc acccagagtt ttatctttag caaaataacc 1980 
attaaagttg tttgcctttt aaaaaaaaaa 2010 

<210> 64 

<211> 4677 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 7506003CB1 

<400> 64 

gagaggatgc cgcagatgtg ttagcggcga gtccagaagc agccccagga ggtgctgggg 60 
gcatcgtttc tctaatctgg cctcccgagt gccaaggagg cgtcccggca gcggtcatca 120 
tggtgaagga gcagttccgg gagacggatg tggccaagaa aataagccac atctgttttg 180 
gaatgaagtc acctgaggag atgcgccagc aggcgcacat ccaagttgtg agtaagaacc 240 
tgtacagcca ggacaaccaa catgccccct tgctatatgg ggtgctcgac cataggatgg 300 
gtacgagtga gaaggatcgt ccatgtgaaa cctgtgggaa aaacttggct gactgtctag 360 
gccactatgg gtatatcgac ctggagttgc cgtgttttca tgtagggtac ttcagagcag 420 
tcataggcat cttacagatg atctgcaaaa cctgctgcca catcatgctg tcccaagagg 480 
agaagaagca gtttctggac tatctaaaga ggcccggcct gacctacctt cagaagcgag 540 
gactgaaaaa gaaaatctct gacaagtgcc ggaagaaaaa catctgccat cactgtggcg 600 
cttttaatgg taccgtaaag aagtgtggac tgctgaaaat aattcatgag aaatacaaga 660 
ccaacaaaaa agtggtggat cccattgtat caaatttcct tcagtctttt gaaacagcca 720 
ttgaacataa taaagaagtg gagcctctgc tgggaagggc acaggaaaac ttgaatccct 780 
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tagtagttct gaatttattt aaacgaatcc 
acccagaagc cggaaagccg tctgatttga 
gtttcagacc ctccgttgtg agtgatttga 
tgaaactgac agaaatcatt ttcctaaacg 
ccaagaccca gatgatcatg gaggactggg 
ttaacagtga gctctcgggc attcccctca 
tcgtccaacg cctgaaggga aaacagggtc 
tggatttttc tggcagaaca gtcatctcgc 
ctgtgccagt tcatgtggcc aaaattctaa 
tcaatttctt gaggaaactg gttcaaaacg 
ttcagcagag acatacgcag atgaaaaggt 
ctcaagagct caagtatggt gacatcgtag 
tgttcaatcg gcagccctcg ctgcacaaat 
agccccaccg gaccttcaga tttaatgagt 
atggtgatga aatgaacctt catcttcctc 
ttctgatggg gactaaagca aatcttgtaa 
ctattcagga ttttctaaca ggtgcctatc 
gagccaaggc ttgccaaatc attgcttcaa 
ttcgcctccc accgcctaca atcctaaagc 
tcagtgtcat cctcaggcct agcgatgaca 
gcaagcagta ctgtggcaaa ggggaagatc 
agaacagtga gttgatgagt ggcagcatgg 
acaatatttt ttacattttg ctgcgagact 
cacggctcgc caggctggct cctgtctacc 
gtgatgtcac acctggccaa ggactgctga 
acaagaaatg tgatgagtac atcgaagccc 
gctgcactgc tgaggagacc ctggaggcac 
accacgctgg cagtgcctgc ctccgggagc 
ctctgtgcgg ctccaaaggt tccttcatta 
agcaggccat cagtggctct cgagtgccag 
ttgaaaaaca ctcaaagctc ccagctgcca 
gtttgacacc aactgagttt ttcttccaca 
cggctgtaaa gacagctgaa acgggataca 
atctttgctc ccagtatgat ctgacagtcc 
tttatggagg agatggctta gatcctgcag 
ttaaaagggt tctggacaac atcaaagcag 
gcaaaaacga gctgatcctg accacagagt 
gccaggacag cttcctgcag gaaataaaaa 
agaaaaccag agataaatat ggcatcaatg 
accagctgga ccgcatcacc cccacccaag 
agtacatgag ggcacagatg gagccaggtt 
ttggtgagcc aggcacccag atgaccctga 
tgaacatcac cctgggcgtg ccccggatta 
gcactccaat tatcacagca cagctagaca 
tgaaagggag aattgagaaa accctcttgg 
ccatctgcac atccaagctc cgtgtgaagc 
tggtgtgtgt cacccccaga gagaacagca 
tgaaagagga tctccccaag gtggtggtgc 
tccacattga cgagcagagt ggaaaggaga 
tgcgggcagt catggccaca cacggtgtga 
atgaggtgga gaaaactctg ggcatcgagg 
agtacaccat ggtgaaccac ggcatgagca 
acctcatgac ctacaagggt gaagtcctgg 
aggagagtgt gctgatgctg gcctcctttg 
cctacttcgg gcagaaggac tctgtgtgtg 
caatgaacat tggaaccggg ctcttcaagc 



cagctgaaga tgttcctcta cttctgatga 840 
ttctcacacg acttttggtg cctcctttgt 900 
agtctggcac caatgaagat gatctgacaa 960 
atgttattaa aaagcatcgg atctcaggag 1020 
atttcctgca gctgcagtgt gccctctaca 1080 
acatggcacc caagaagtgg accagaggct 1140 
gatttagagg aaatctctca ggaaagagag 1200 
ccgaccccaa cctccggatt gatgaggtag 1260 
cttttcctga gaaggtaaac aaagcaaaca 1320 
gccctgaggt tcacccagga gcaaacttca 1380 
ttttgaaata cggaaatcga gaaaagatgg 1440 
agagacacct catcgatgga gatgtggtgc 1500 
tgagcattat ggctcatctg gccagggtca 1560 
gtgtctgtac accctataat gctgactttg 1620 
aaacagaaga agctaaagca gaggcccttg 1680 
ccccgaggaa tggggaaccg ctgattgctg 1740 
tcctcactct caaggacact ttctttgatc 1800 
tactggttgg caaggatgag aaaattaaag 1860 
ctgtcaccct gtggacggga aagcagatct 1920 
atccagtgag ggccaacctg cgaaccaagg 1980 
tctgtgccaa tgattcctat gttacaatcc 2040 
acaaaggaac cctagggtca ggatccaaga 2100 
ggggacagaa tgaagctgca gatgccatgt 2160 
tgtctaaccg tggtttctca attgggatcg 2220 
aggccaagta tgagttgctg aatgccggct 2280 
tgaacacggg caagctgcag cagcagcctg 2340 
tgatcctgaa ggagctgtct gtgatccgtg 2400 
tggacaagag caacagcccc ctcaccatgg 2460 
acatatcaca gatgattgcc tgtgtgggac 2520 
acggctttga aaacaggtcc ttgcctcatt 2580 
aaggctttgt ggctaatagc ttttattccg 2640 
caatggccgg ccgggaaggt ctagtcgaca 2700 
tgcagcgaag gcttgtcaaa tctcttgaag 2760 
gaagctctac tggcgatatt atccagttca 2820 
ctatggaggg aaaagatgaa cctttggagt 2880 
tcttcccgtg tcccagtgag cctgctctca 2940 
ccatcatgaa gaagagtgag ttcctctgct 3000 
aattcattaa gggggtctct gagaagatca 3060 
ataacggcac aacagagccc cgtgtgctgt 3120 
tagaaaagtt tctggagacc tgtagggaca 3180 
ctgcagtggg tgctctgtgt gcccagagca 3240 
agactttcca ctttgcaggt gtggcctcca 3300 
aagagatcat caacgcttcc aaggccatca 3360 
aggatgacga cgcggattat gctcgcctcg 3420 
gagaggtgaa cgctgagaca gtgagatatt 3480 
ccggtgatgt ggctgttcat ggtgaggctg 3540 
agagctccat gtactacgtg ctgcagttcc 3600 
agggcattcc agaggtgtcc agagctgtca 3660 
agtacaagct tctggtggaa ggtgataacc 3720 
agggcacccg aaccacctcc aataacacct 3780 
ccgcccggac aacgatcatc aatgaaatcc 3840 
tcgacaggag gcacgtgatg ctgctctccg 3900 
gcatcactag gtttggcctg gccaagatga 3960 
agaagacggc tgaccatctc tttgacgctg 4020 
gggtgtctga gtgcatcatc atgggaatcc 4080 
tgcttcacaa ggctgacagg gacccgaacc 4140 
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ctcccaagag gcccctgatc 
ccaaagaaag aggggaccat 
aaagagtttt gtgctccctg 
agaagccctc ttggcatccc 
gtgaccctga ctgtgccacg 
aaacctaatc tatagacggc 
ccagccaggt gtcatggtcc 
tcttcacacc tgacgatgag 
ttttccattc cttggtcatg 



ttcgacacaa atgaattcca 
gcctgacctt gactccttgt 
ggacgggggt cctgaggtcc 
caggagcagc ttctcctctg 
ctgctcggga gagctgaggg 
cccacagctc gtgcacacac 
atcccagctc' actgcataca 
ccaggcctga gccccacaca 
ctggaatctc tcaatgtgac 



catccccctt gtcacatagt 4200 
cctgtctcca gctgatgtat 4260 
ccacctatgc cagcaatcag 4320 
atagggtgca gctcacacca 4380 
ttttattgtt tgcttgcttg 4440 
tgcttccctg gacttaaagc 4500 
tccatcggct ccccatggtg 4560 
ggccagggca cattcttgga 4620 
atacttatgt aaatatc 4677 



<210> 65 

<211> 1110 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_£eature 

<223> Incyte ID No: 6483977CB1 



<400> 65 

cggctcgagc cgggactctt ggcgggtgaa 
cctgggcgct gcttgctaaa gagccgagca 
cggtacggag gagaaaccaa ggtgtatgtt 
gagttagaaa gggctttcag ttattatggt 
cctccaggat ttgcctttgt ggaattcgaa 
ggactggatg gaaaggtgat ttgtggctcc 
c'ctcggagat cacgttttga tagaccacct 
tgctatgagt gtggcgaaaa gggacattat 
agaagaagca ggtcacggtc tagatcacat 
tcacgcagca ggagcagggg acgaaggtca 
atctctcttc gtagatcaag atcagcttca 
ggatcgaggt atttccaatc cccgtcgagg 
ccaagaagca gtcgttcccc atcaggaagt 
gactgaagct ctcaagttca ccctttaggg 
gatttgtgat gtctgtaaag tgtaacctag 
gatctggatt actatgtaaa ttcacagcag 
aacatcatat ggtctgaaaa tgtgggtttt 
actagatttt tgatttgtgt tcaatattaa 
gacattataa ttgttaacct tattcataca 



ggtgtgtgtc agcttttgcg tcactcgagc 60 
cgcgggtctg tcatcatgtc gcgttacggg 120 
ggtaacctgg gaactggcgc tggcaaagga 180 
cctttaagaa ctgtatggat tgcgagaaat 240 
gatcctagag atgcagaaga tgcagtacga 300 
cgagtgaggg ttgaactatc gacaggcatg 360 
gcccgacgtc cctttgatcc aaatgataga 420 
gcttatgatt gtcatcgtta cagccggcga 480 
tctcgatcca gaggaaggcg atactctcgc 540 
aggtcagcat ctcctcgacg atcaagatct 600 
ctcagaagat ctaggtctgg ttctataaaa 660 
tcaagatcaa gatccaggtc tatttcacga 720 
cctcgcagaa gtgcaagtcc tgaaagaatg 780 
aaaagttatt ttgtttacat tattataagg 840 
gaaagataat tcaaccatct aatcaaaatg 900 
taagataata taaattttgt tgaatgtatt 960 
tatttggcac atttaaataa aatgtttcta 1020 
cacttcttaa tttgatatat ttgagagtca 1080 

1110 



<210> 66 

<211> 687 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 6301777CB1 

<220> 

<221> unsure 

<222> (1) ... (687) 

<223> a, t, c, g, or other 



<400> 66 

tttttgtgtt gtgttttctc ttttcttttg ttgttctctg gtttgcctgc cttcagtctc 60 
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ttttccgaaa agggatgtgc tgcaaggcga tttacttggt aacgccaggg ttttcccagt 120 
cacgacgttg taaaacgacg gccagtgaat tgaatttagg tgacactatt agaagannnn 180 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnngccc gggccgcgcc 240 
cgggacctca ttccggacct cggtttccct ttgcgtgggg ctcgcagctg ccaccgcttc 300 
cccgcggagc atggggctgc ctcgccgggc aggggacgcg gcggagctgc gcaagagcct 360 
gaagccgctg ctggagaagc gccggcgcgc gcgcatcaac cagagcctga gccagcttaa 420 
ggggctcatc ctgccgctgc tgggccggga ggatgcttct ggctggcaca cctggcttcc 480 
cctccatgct cagaactgcc tcctactcta catccaggct cctgagcagc ccccagctta 540 
agcttcagca gctccagggt aaatccatct cggcccagag ctgactccgg atctgcccca 600 
gtgacaataa caagataaat gccttccatt ctcttcgtac ttacaggatg taatccacat 660 
atttacacag tgtttcattt gcggtaa 687 

<210> 67 

<211> 599 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 7505976CB1 

<400> 67 

gactttcgct tccggctgcc gcaggcgctt cgctggtgca gacgcagtgc tgagcacaca 60 

gctaccggac aaagagtgac gcccggagct ggagttatgg cggctacgga gccgatcttg 120 

gcggccactg ggagtcccgc ggcggtgcca ccggagaaac tggaaggagc cggttcgagc 180 

tcagcccctg agcgtaactg tgtgggctcc tcgctgccag aggcctcacc gcctgcccct 240 

gagccttcca gtcccaacgc cgcggtccct gaagccatcc ctacgccccg agctgcggcc 300 

tccgcggccc tggagctgcc tctcgggccc gcacccgtga gcgtagcgcc tcaggccgaa 360 

gctgaagcgc gctccacacc aggccccgcc ggctctagac tcggtcccga gacgttccgc 420 

cagcgtttcc ggcagttccg ccgccgcacg gatgtgcgca tcactggctg agcggtggag 480 

ctgcgggcgg ccagggccgg gcgctctgtg cggactgggg ccatgatcgg gcccgggggc 540 

ctgagcctgg gaccccaccc cgtgttaatg aaaaatgagt tttggcagcg aaaaaaaaa 599 

<210> 68 
<211> 2095 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 7506016CB1 

<400> 68 

cgccgcccgg ccggtgctcc ctggcgcagc cacgcaggcg caccgcagac agacccctct 60 
gccatgaacc agtccatccc agtggctccc accccacccc gccgcgtgcg gctgaagccc 120 
tggctggtgg cccaggtgaa cagctgccag tacccagggc ttcaatgggt caacggggaa 180 
aagaaattat tctgcatccc ctggaggcat gccacaaggc atggtcccag ccaggacgga 240 
gataacacca tcttcaaggc ctgggccaag gagacaggga aatacaccga aggcgtggat 300 
gaagccgatc cggccaagtg gaaggccaac ctgcgctgtg cccttaacaa gagccgggac 360 
ttccgcctca tctacgacgg gccccgggac atgccacctc agccctacaa gatctacgag 420 
gtctgctcca atggccctgc tcccacagac tcccagcccc ctgaggatta ctcttttggt 480 
gcaggagagg aggaggaaga agaggaagag ctgcagagga tgttgccaag cctgagcctc 540 
acagaggatg tcaagtggcc gcccactctg cagccgccca ctctgcagcc gcccgtggtg 600 
ctgggtcccc ctgctccaga ccccagcccc ctggctcctc cccctggcaa ccctgctggc 660 
ttcagggagc ttctctctga ggtcctggag cctgggcccc tgcctgccag cctgccccct 720 
gcaggcgaac agctcctgcc agacctgctg atcagccccc acatgctgcc tctgaccgac 780 
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ctggagatca agtttcagta ccgggggcgg ccaccccggg ccctcaccat cagcaacccc 840 
catggctgcc ggctcttcta cagccagctg gaggccaccc aggagcaggt ggaactcttc 900 
ggccccataa gcctggagca agtgcgcttc cccagccctg aggacatccc cagtgacaag 960 
cagcgcttct acacgaacca gctgctggat gtcctggacc gcgggctcat cctccagcta 1020 
cagggccagg acctttatgc catccgcctg tgtcagtgca aggtgttctg gagcgggcct 1080 
tgtgcctcag cccatgactc atgccccaac cccatccagc gggaggtcaa gaccaagctt 1140 
ttcagcctgg agcattttct caatgagctc atcctgttcc aaaagggcca gaccaacacc 1200 
ccaccaccct tcgagatctt cttctgcttt ggggaagaat ggcctgaccg caaaccccga 1260 
gagaagaagc tcattactgt acaggtggtg cctgtagcag ctcgactgct gctggagatg 1320 
ttctcagggg agctatcttg gtcagctgat agtatccggc tacagatctc aaacccagac 1380 
ctcaaagacc gcatggtgga gcaattcaag gagctccatc acatctggca gtcccagcag 1440 
cggttgcagc ctgtggccca ggcccctcct ggagcaggcc ttggtgttgg ccaggggccc 1500 
tggcctatgc acccagctgg catgcaataa caaggctgca gacggtgact ggccctggct 1560 
tcctgggtgg cggtgcggac tgatgtggag atgtgacagc cccgatgagc acctggctgg 1620 
ctgcagggtc ctacctctgg gtttcctgga agtggatttg ggccaagaag gagagggaga 1680 
aaggcccgag cccctgcctt cccgggcctt tctctcctgg gctgtctctg gtctggtcag 1740 
cctggctctc gggaaattca gccatgagca gggaaagaac tctcccaacc ctggggccta 1800 
gctgtatagg aggaattgcc taagggtggc ccactcttgt gattgcccca tttcctctgg 1860 
caacaaaagc cagagtgttg tgggccaagt ccccccacag ggcctctgca gggcatggcc 1920 
ctgatttccc tggtttgaga ctcacttcct catctccctg tcctctgaga taatatgagt 1980 
gagcacttag gtatcatatc agatgctcaa ggctggcagc tacccccttc ttgagagtcc 2040 
aagaacctgg agcagaaata atttttatgt attttggatt aataaatgtt aaaaa 2095 

<210> 69 
<211> 2843 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 7506086CB1 

<400> 69 

actagttctc tctctctctc taggcagcaa ggagaagcag gctgacttcc tggacaagca 60 
gatgagccgg gatgagcacc gagcccgagc catgaagatc cctttcacca atgacaaaat 120 
catcaacctg cctgtggagg agttcaatga actgctgtcc aaataccagt tgagtgaagc 180 
ccagctgagc ctcatccgag agtgggcatg gcgacttggt ctcagccgga ctcgggtttc 240 
tctggaaacc cccctggtaa gtgtggagga ggcgggacac tctgacccaa gacgaaaggc 300 
ctgtagctcc agccaaagaa aataaacctt aggagggaga aggaaaaaaa aaatccatca 360 
gctgttcctg agaacagcct gcattggaat ctacagagag gacaactaat gtgagtgagg 420 
aagtgactgt atgtggactg tggagaaagt aagtcacgtg ggcccttgag gacctggact 480 
gggttaggaa cagttgtact ttcagaggtg aggtgtcgag aagggaaagt gaatgtggtc 540 
tggagtgtgt ccttggcctt ggctccacag ggtgtgcttt cctctggggc cgtcagggag 600 
ctcatccctt gtgttctgcc agggtggggt acggggtttg acactgagga gggtaacctg 660 
ctggctggag cggcagagca gtggccttga tttgtctttt ggaagatttt aaaaaccaaa 720 
aagcataaac attctggtcc ttcagcaatg ctttctctga agaaatactt aacggaagga 780 
cttctccagt tcaccattct gctgagtttg attggggtac gggtggacgt ggatacttac 840 
ctgacctcac agcttccccc actccgggag atcatcctgg ggcccagttc tgcctatact 900 
cagacccagt tccacaacct gaggaatacc ttggatggct atggtatcca ccccaagagc 960 
atagacctgg acaattactt cactgcccgg cggctcctca gtcaggtgag ggccctggac 1020 
aggttccagg tgccaaccac tgaggtaaat gcctggctgg ttcaccgaga cccagagggg 1080 
tctgtctctg gcagtcagcc caactcaggc ctcgccctcg agagttccag tggcctccaa 1140 
gatgtgacag gcccagacaa cggggtgcga gaaagcgaaa cggagcaggg attcggtgaa 1200 
gatttggagg atttgggggc tgtagccccc ccagtcagtg gagacttaac caaagaggac 1260 
atagatctga ttgacatcct ttggcgacag gatattgatc tgggggctgg gcgtgaggtt 1320 
tttgactata gtcaccgcca gaaggagcag gatgtggaga aggagctgcg agatggaggc 1380 
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gagcaggaca cctgggcagg cgagggcgcg 
ggagagactg gggagagctt ccctgcacag 
tccctggaag agtgccttag gctgctggaa 
tttccagcag acatttccag cataacagaa 
cttcaaaaca acctcttgtc tcctcttctg 
cagcagtggc aagatctcat gtccatcatg 
tctgactctg agaccctgga tctggaagag 
tattccaagt tctgccgcat gagctaccag 
ctggagcacg tgggccacaa ccacacatac 
gacctgccac cacccagtgc cctcaagaaa 
gacaagcaga tgagccggga tgagcaccga 
gacaaaatca tcaacctgcc tgtggaggag 
agtgaagccc agctgagcct catccgagac 
gcgcagaact gccgcaagcg caagctggac 
gacctgcagc gtgacaaagc ccggctgctg 
cgacagatga agcagaaggt ccagagcctg 
gagaacggac gaccctactc gcccagtcag 
gtcctcctca tcccccgcac gatggccgac 
aaggaccgga gaaagtgagc ctggggaaga 
tggagaaggg ctgggacctg gacctggacc 
ttatccaata tatcttctca gatgggatga 
actggctggc tcagctccac tcgggtggag 
ggtcctcacc ctaccccttt cctgtgaggc 
ggagctatgt ggagcaaagg ccgacagagg 
ggtggcagaa agtctcattt cag 



gaagctctgg cacggaacct gctagtggat 1440 
gtgcctagtg gggaggacca gacggccctg 1500 
gccacctgcc cctttgggga gaatgctgag 1560 
gcagtgccta gtgagagtga gccccctgct 1620 
accgggacag agtcaccatt tgatttggaa 1680 
gaaatgcagg aaggtgcggt tggctacagc 1740 
gccgagggtg ctgtgggcta ccagcctgag 1800 
gatccagctc agctctcatg cctgccctac 1860 
aacatggcac ccagtgccct ggactcagcc 1920 
ggcagcaagg agaagcaggc tgacttcctg 1980 
gcccgagcca tgaagatccc tttcaccaat 2040 
ttcaatgaac tgctgtccaa ataccagttg 2100 
atccggcgcc ggggcaagaa caagatggcg 2160 
accatcctga atctggagcg tgatgtggag 2220 
cgggagaaag tggagttcct gcgctccctg 2280 
taccaggagg tgtttgggcg gctgcgagat 2340 
tatgcgctcc agtacgccgg ggacggcagt 2400 
cagcaggccc ggcggcagga gaggaagcca 2460 
agggggtttg aagcccacca agaccgaaac 2520 
tggacctaca gcggggactt aaatgccttc 2580 
ctgcgggtca gtgtacagga agaggcaggc 2640 
tggaagtggc cagaccattt agacggacag 2700 
aggggtggtg gtggagttgc tggaggtaga 2760 
ggaaggaatg gacctgtgag aggaagggaa 2820 

2843 



<210> 70 
<211> 2482 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> mi sc_ feature 

<223> Incyte ID No: 4176657CB1 



<400> 70 

gtgacgcttt ccctgggctc tgaaggccag 
tccccaaagt cacaacccct tctatcccag 
ttggacgtgc atggctgtgt cctgcaatcc 
agcgggtccg tgcccccgtt cctcgcgcca 
cagaggtgtc cctggataca ggatagcaaa 
gacagccctg caaaacagac cctatttgga 
atgactcctg aatcaaggga tactacagat 
ggcatcgtga tagtgaaggt ggaggaggaa 
aacaaagtag agtcatcgcc acaagttctc 
ttattgtcat cgtatttagt tgcatataga 
gctgaaaaaa ttatccttcc agcatgtatg 
tcagctgata aactaagaac tatacctctt 
acgattgcaa aacatttgga agcaatgctt 
gcaatccaac tcgatgagag cactgatatt 
agatatgtgt ggcaagatga ttttgtagag 
catataactg gattagattt atttactgaa 
ttaaactgga aacattgtaa aggaatttca 
cacagcagac ttactgaaaa attgttagaa 
tgttttattc atcgagaagc tttggtatcc 
ttgaaaaatg cagtgaaaac tgttaatttt 



ggggcttgga tgtgagtcta gtgctcctgg 60 

catcccagac ttctccgggc ggggacagag 120 

cctcgccacg accgcccggg acggagtgga 180 

gcctcggcgg tggcaagcgc tgtcccccag 240 

acctgatctc ataaaaccta ggtcacaaag 300 

tcaagtgagc cagttcctgg aacctgaata 360 

ttgtctccag ggggtaccca ggagatggaa 420 

gatgaagaag accattttca aaaggaaaga 480 

agtcgctcta caactatgaa tgagagagcc 540 

gtggcaaaag agaaaatggc tcacacagcg 600 

gacatggtac ggacaatttt tgatgacaaa 660 

agtgataata caatatctcg tcgaatctgt 720 

attacacggc tgcagtccgg tatagacttt 780 

gcaagttgtc ccacactctt ggtttatgtc 840 

gatctcttat gttgtttaaa tttaaattca 900 

ttagaaaact gccttcttgg tcagtataaa 960 

agtgatggaa cagcaaatat gaccggaaaa 1020 

gcaacccaca acaatgctgt ttggaatcac 1080 

aaagaaattt caccaagtct gatggatgta 1140 

attaaaggaa gctcactgaa tagccgactt 1200 
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ctcgaaatat tttgttcaga gattggagtg aaccacaccc acttattgtt tcatacagaa 1260 

gttcgttggc tttctcaagg aaaagtattg agcagagtat atgaactcag gaacgagatt 1320 

tacatttttc tcgttgaaaa gcaatctcat ttggcaaata tttttgaaga cgacatttgg 1380 

gtaacaaaat tggcatattt aagtgatatt tttggcattc ttaatgaatt aagcctgaaa 1440 

atgcagggga aaaacaatga tatatttcag tatcttgaac atattctagg attccaaaag 1500 

acgttattat tgtggcaagc aagacttaaa agtaaccgcc ctagctacta tatgtttcca 1560 

acattattgc aacacatcga agagaacatt attaatgaag actgcttaaa agaaataaaa 1620 

ttagagatat tgttgcatct cacttctttg tctcaaactt ttaattatta ctttccggaa 1680 

gagaaatttg aatcattaaa ggaaaatatt tggatgaaag atccatttgc ttttcaaaac 1740 

ccagaatcaa taattgagtt aaacttggag cctgaagaag agaatgaatt attgcagctc 1800 

agttcatcat tcacactaaa gaattattat aagatattaa gtttatcagc attttggatt 1860 

aagattaaag atgactttcc actgctaagt aggaagagta tattgctgtt actaccattc 1920 

acaactacat atttgtgtga actaggattt tcaatcttga cacggttaaa aacaaagaag 1980 

agaaataggc tcaatagtgc accagatatg cgggtagcat tatcttcatg tgttcctgac 2040 

tggaaggaac ttatgaacag acaagcacac ccatcacatt aaatacaaac tttacaaaat 2100 

tctgtgtata gccaggtgtg gtggcttacg cctgtaatcc cagcagtggg agaccgaggt 2160 

gggcagatca cttgagttca agaccagcct ggccaacatg gtgaaacccc atctctacta 2220 

aaaatagaaa ccttagccag gcgtggtggc acatgcctgc agtcccagtt acttgggtgc 2280 

ctgaggcagg agaatctctt aaaccaggaa ggcagagatt gcagtgagct gagataatcc 2340 

cactgcattc cagcctgggc aacagcgtga gacttcatct caaaaaaaaa aaattgtatt 2400 

tgtactttta aagggatttt gcagtatgtt gtagttaaac gttaataaaa ttatatttgt 2460 

aattaggaaa aaaaaaaaaa aa 2482 

<210> 71 
<211> 1730 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 7506056CB1 

<400> 71 

gtccccttgc ttcgtcttcg cttttctttc tacttattct tatctgtgtc tttcgctttg 60 
tttgcctctc cgtctgtttt ccctcagggc ccccttcttt cctcgacctt ttcaaatcgc 120 
aaatatggcg ccggagcggc tgcggagccg ggcgctctcc gccttcaagt tgcggggctt 180 
gctgctccgt ggtgaagcta ttaagtacct cacagaagct cttcagtcta tcagtgaatt 240 
agagcttgaa gataaactgg aaaagataat taatgcagtt gagaagcaac ccttgtcatc 300 
aaacatgatt gaacgatctg tggtggaagc agcagtccag gaatgcagtc agtctgttga 360 
tgaaactatt ctgttaatga ccaaccaccc tgcaccaaat ttatttggaa caccaagaga 420 
taaagcagag atgtttcgtg agcgatatac cattttgcac cagaggaccc acaggcatga 480 
attatttact cctccggtga taggttctca ccctgatgaa agcggaagca aattccagct 540 
taaaacaata gaaaccttat tgggtagtac aaccaaaatc ggagatgcga ttgttcttgg 600 
aatgataacg cagttaaaag agggaaaatt ttttctggaa gatcctactg gaacagtcca 660 
actagacctt agtaaagctc agttccatag tggtttatac acagaggcat gctttgtctt 720 
agcagaaggt tggtttgaag atcaagtgtt tcatgtcaat gcctttggat ttccacccac 780 
tgagccctct agtactacta gggcatacta tggaaatatt aatttttttg gaggtccttc 840 
taatacatct gtgaagactt ctgcaaaact aaaacagcta gaagaggaga ataaagatgc 900 
tatgtttgtg tttttatctg atgtttggtt ggaccaggtg gaagtattgg aaaaacttcg 960 
cataatgttt gctggttatt caccagcacc tccaacctgc tttattctgt gtggtaattt 1020 
ttcatctgca ccatatggaa aaaatcaagt tcaagctttg aaagattccc taaaaacttt 1080 
ggcagatata atatgtgaat acccagatat tcaccaaagt agtcgttttg tgtttgtacc 1140 
tggtccagag gatcctggat ttggttccat cttaccaagg ccaccacttg ctgaaagcat 1200 
cactaatgaa ttcagacaaa gggtaccatt ttcagttttt actactaatc cttgcagaat 1260 
tcagtactgt acacaggaaa ttactgtctt ccgtgaagac ttagtaaata aaatgtgcag 1320 
aaactgcgtc cgttttccta gcagcaattt ggctattcct aatcactttg taaagactat 1380 
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cttatcccaa ggacatctga ctcccctacc 

tgactatgct ttgagagtgt atcctgtgcc 

tcctttcact acgacaaata ccgaatgcct 

tggattttca ttcaaagttt tttatccttc 

aggcttttga gattcttaaa gatcatctga 

atatcttatg tgattctgat attacaataa 



tctttatgtc tgcccagtgt attgggcata 1440 

cgatctactt gtcattgcag acaaatatga 1500 

ctgcataaac cctggctctt ttccaagaag 1560 

taataagaca gtagaagata gcaaacttca 1620 

agaaaattca tcagttttct gcttaactct 1680 

aattatggta aactttaaaa 1730 



<210> 72 
<211> 3119 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7506185CB1 



<400> 72 

cccggggcct ggctttcagc gtagcaattc 
gtggcagcag agctccttag gacgaggagc 
tcgcaaactg ggcgtctgtt ccggcgccgg 
aaacacagaa ggttggtgcc atccattttc 
aagacttcat ggcagagctg taccgggttt 
gtccaactgg cactgggaag tccttaagtc 
actttgaaca gaagaagcgt gaagaagagg 
tacatgatga gaaagatgaa tccctgtgtc 
ccccgaggcc tgctggagaa ccggcctggg 
gggacctggt ggaccgacta aaggcggagc 
tgcagcagct gcagcacagg gtgcagctca 
aagaagaaag agagaatctc ctccgcctca 
ctgagcggct ggagcagctg gagtctgggg 
gtgatgagga gaaaaaggtg gcgagcagag 
aacacataac taagatttat tactgtagtc 
atgaggtgaa gaagagcccc tttggcaagg 
agaacctttg tgtaaatgaa gacgtgaaaa 
gctgtgtgga catgcagaga agcaggcacg 
caaagaggag gaggcaggag aagcaggcag 
gccttctccg ggatgaggcc ctggcagagg 
ggaaggaggc ccgggcctgt ccctattacg 
tggtggtgct gccctatcag atgctgctgc 
ggctgcagga ccaggtggtg atcatcgacg 
gcatgcacag cgtggaggtc agcggctccc 
agtacgtgga gcgatacggg aagcgtttga 
tcctgtattt gctggagaaa ttcgtggctg 
atacacagag tctgtcacag acagggacgg 
agagccagat cgacaacatc aacctgttca 
tcagcagaaa gctctttgga ttcactgaac 
agcccaaact ggctgggttt cagcaattcc 
ctcttgcagc ccctgcagac gagagtcagg 
tgcacatcca aggcttcctg gcagctctca 
tgagccgcca aggcagcctc agtcagagca 
tgcactttgc ccaagtggtg aaggaatgcc 
agccggtgtc tgacttccgg cagcagctgc 
tggtggagtt ttcctgtggt cacgtgatcc 
gcagcgggat ctccaaccag ccgctggaat 
tgatggacga ggtgggtcgc attctctgta 
tctgtttctt cccctcctac gagtacctgc 



tgccggcgaa gaaggtgagc gcagtgctgt 60 
agcgggacga ggaagggcag actggtgaaa 120 
acccctattt gcaaaggtcc atggctaatg 180 
cttttccctt cacaccctat tccatccagg 240 
tggaggctgg caagattggg atatttgaga 300 
ttatttgtgg ggccctctct tggctccgtg 360 
cacgactcct tgaaactgga actggcccct 420 
tgtcttcttc ctgcgaaggg gctgcaggca 480 
ttactcagtt tgtgcagaag aaagaagaga 540 
aggccaggag gaagcagcga gaagaacgcc 600 
agtatgcagc caagcgcctg aggcaggaag 660 
gcagggagat gctagagaca ggcccggagg 720 
aggaggagct ggtcctcgcc gaatacgaga 780 
tggatgagga tgaggatgac ctggaggaag 840 
ggacacactc ccagctggcc cagtttgtgc 900 
atgttcggct ggtctccctt ggctcccggc 960 
gcctaggttc tgtgcagctt atcaacgacc 1020 
agaagaagaa aggagctgag gaggagaagc 1080 
cctgcccctt ctacaaccac gagcagatgg 1140 
tgaaggacat ggagcagctg ctggcccttg 1200 
ggagccgcct tgccatccct gcagcccagc 1260 
atgcggccac tcggcaggcc gcgggcatcc 1320 
aggcgcacaa cctgatcgac accatcacgg 1380 
agctctgcca ggcccattcc cagctgctgc 1440 
aggccaagaa cctgatgtac ctgaagcaga 1500 
tgctaggggg gaacattaag caaaatccca 1560 
agctgaagac catcaacgac tttctcttcc 1620 
aggtgcagcg atactgtgag aagagcatga 1680 
ggtacggagc agtgttctca tcccgggagc 1740 
tgcagagcct gcagcccagg acgactgaag 1800 
ccagcaccct gcgaccagct tctccactga 1860 
ctacggccaa ccaggacggc agggtcatcc 1920 
ccctgaagtt tttgctcctg aatccagctg 1980 
gggcagtggt cattgcgggg ggtaccatgc 2040 
tggcctgtgc cggggtggaa gctgagcgcg 2100 
ctccagacaa catcctgccc ctcgtcatct 2160 
tcacgttcca gaaaagagag ctgcctcaga 2220 
acctgtgcgg tgtggttcct ggaggggtgg 2280 
gccaggtcca tgcccactgg gagaagggtg 2340 
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gcctgctggg ccgtctggct 
aggtggagca ggtgctgctg 
gccaggtgac aggggccctg 
acttctctga caacctaggc 
aaaccctccc cagagccccc 
gcatgaaggc cgtcaaccag 
gcgtagtgct cctggaccag 
ggatccgagc ccgtgtggag 
agtttcaccg ggagaagtcg 
tgcccttcct ttgtcctgcc 
tcctgttaca aaggtgaaac 
aggcacaggc gttagctccc 
tggctgtcct tggggcattc 

<210> 73 
<211> 1881 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 8096611CB1 

<400> 73 

ggggcaggtc ccggacccga caagctgact tggcgactcc tcgcagcccc tgctctggca 60 
cgccacacct cagacgaggc tttgaaattt gttaaataca agtaaggaac acaggcaaca 120 
tgttctacac acatgtgctt atgagtaaac gagggccatt ggccaaaata tggcttgcag 180 
ctcactggga gaagaaactc acaaaggccc atgtatttga atgtaatcta gagataacca 240 
ttgaaaaaat tctttcaccc aaggtgaaaa tagcacttcg aacttcagga caccttcttt 300 
tgggagttgt tcgaatctat aacaggaagg caaaatatct tttggcagat tgcagtgaag 360 
catttcttaa aatgaagatg acatttcgcc caggactggt tgaccttcca aaagagaatt 420 
ttgaagcatc ttacaatgct atcacattgc cagaagaatt tcatgatttt gacacccaaa 480 
atatgaatgc tattgatgtt tcagaacact ttactcagaa ccaaagcaga ccagaagaaa 540 
tcactcttag agaaaatttg gacaatgatc tacttttcca agctgagagc tttggggagg 600 
aatctgaaat tctcagaaga catagcttct ttgatgacaa catattactg aattccagtg 660 
gtcctttaat tgaacatagt tctggaagcc tcactggaga acgatctcta ttctatgaca 720 
gtggagatgg gtttggagat gaaggagctg caggagaaat gattgacaat ctattgcaag 780 
atgatcagaa tatcctgtta gaagacatgc atttgaacag agaaatttcc ctgccttctg 840 
agcctcccaa tagtttagca gttgaaccag ataactcaga gtgtatatgt gtacctgaaa 900 
atgaaaaaat gaatgaaaca atattattat caactgaaga ggaaggattt acccttgatc 960 
caattgatat ttcagacatt gctgagaaaa ggaaaggcaa aaagaggaga ttgctcatag 1020 
atcctatcaa ggagctcagt agcaaagtta tacataaaca gcttacttcc tttgcggaca 1080 
cactcatggt tttggaactt gcacctccta cccaaagatt gatgatgtgg aagaagaggg 1140 
gaggagtgca tacacttctg tcaactgctg cccaggattt gattcatgct gaactgaaaa 1200 
tgttgtttac aaaatgcttt ctgtcctctg gctttaaact tggaagaaaa atgatacaga 1260 
aggagtcagt aagggaagaa gtgggaaacc aaaatatagt agagacatcc atgatgcaag 1320 
agccaaatta ccagcaagag ttaagtaaac cccaaacttg gaaggatgtg attggtggat 1380 
ctcagcatag ctctcatgag gataccaata aaaatattaa ctctgaggat attgttgaaa 1440 
tggtgtcttt agctgctgag gaatcatctt taatgaatga cttatttgca caagaaattg 1500 
aatatagtcc agttgaattg gagtcattga gcaatgaaga aaatattgag acagagagat 1560 
ggaatggaag aatacttcag atgttaaatc gtttacggga atctaacaag atgggaatgc 1620 
agtcctttag tctgatgaag ctctgtagaa atagtgaccg aaaacaagca gctgccaaat 1680 
tttatagctt tcttgtccta aagaaacagc tggctattga gctgagccag agtgctccct 1740 
atgcagatat tatagctacg atgggaccaa tgttttataa catatgaagg aaacccagac 1800 
atacagattt atggcatcac tggaatttct gtgtagattg ttcaatttaa tgcagaagcc 1860 
atctggaagc agctgaaggt c 1881 



gccaggaaga agatattcca ggaacctaag agcgcacacc 2400 

gcatattcca ggtgcatcca ggcctgtggc caggagagag 2460 

ctcctctctg tggttggagg aaagatgagt gaagggatca 2520 

cggtctgcag agctgcagga gaagatggcc tacttggatc 2580 

ggccaggcac ccccagggaa ggctctggtg gagaacctgt 2640 

tccataggca gggccatcag gcaccagaag gattttgcca 2700 

cgatatgccc ggccccctgt cctggccaag ctgccggcct 2760 

gtcaaagcta cctttggccc cgccattgct gctgtgcaga 2820 

gcctcttcct gatgggcaac cacaccactg cctggcgccg 2880 

cgctggagac agtgtttgtc gtgggcgtgg tctgcgggga 2940 

ccaggaggag agtgtggagt ccagagtgct gccaggaccc 3000 

gtaggagaaa atgggggaat cctgaatgaa cagtgggtcc 3060 

cagggcagct cccctcctgg aatagaatct ttctttcca 3119 
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<210> 74 
<211> 1970 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 8174603CB1 

<400> 74 

gcggccatgg cttcggcttc tcagggcgct gacgacgacg gcagtcgtag gaagccccgc 60 
ctggctgcat cgttgcagat cagcccccag ccccgcccct ggcgaccgct acccgcccag 120 
gcccaaagtg cctgggagcc ctgcgcccgc tgcccgcgac gtatcgctgc ggagggcggg 180 
tggccccagg tctcggtgtt gagggacagt gggcctgggg caggggccgg agtcggggaa 240 
ctgggggcag cccgggcgtg ggaaaacttg ggggaacaga tggggaaggc cccgagggtc 300 
cctgtgcccc cagcagggct cagcctgccg ctcaaagacc cacctgccag ccaggccgtg 360 
tccttgctca cggagtacgc ggccagcctg ggcatcttcc tgctcttccg ggaggaccag 420 
ccaccaggtg aggccggtcc ctgcttcccc ttctcggtga gcgcggaact ggatggggtg 480 
gtctgccctg cgggcactgc gaatagcaag acggaggcca aacagcaggc aacgctctct 540 
gccctctgct acatccggag tcagctggag aacccagtgg gcccccttct acctgcagtc 600 
tctcgccctg gcgcagagaa catcctgacc catgagcagc gctgcgcagc gttggtgagc 660 
gccggctttg acctcctgtt ggacgagcgc tcgccatact gggcctgtaa ggggactgtg 720 
gctggagtca tcctggagag ggagatcccg cgtgccaggg gccacgtgaa ggagatctac 780 
aagctggtgg ctctgggcac cggcagcagc tgctgtgctg gctggctgga gttctcgggc 840 
cagcagctcc acgactgcca tggcctggtc atcgcccgca gggccctgct gaggttcttg 900 
ttccggcagc tcctgctggc cacacagggg ggccccaagg gcaaggagca gtccgtgctg 960 
gccccccagc cagggcccgg acccccattc accctcaagc cccgcgtctt cctgcacctc 1020 
tacatcagca acacccccaa gggcgcggcc cgtgacatct acctgccccc cacctcggaa 1080 
ggtggcctcc cgcacagccc acccatgcgc ctgcaggccc atgtgctcgg gcagctgaag 1140 
cctgtgtgct acgtggcgcc ctcgctctgt gacacccacg tgggctgcct gtcagccagc 1200 
gacaagctgg cacgctgggc cgtgctgggg ctgggtggtg ccctgctggc ccacctggtg 1260 
tccccactct acagcaccag cctcatcctg gctgactcat gccacgaccc tccgactctg 1320 
agcagggcca tccacacccg gccctgcctg gacagtgtcc tggggccatg cctgccacct 1380 
ccctacgtcc ggaccgccct gcacctgttt gcagggcccc cggtggcccc ttccgaaccc 1440 
acccctgaca cctgccgtgg cctgagcctc aactggagcc tgggggaccc tggcatcgag 1500 
gttgtggatg tggccaccgg gcgtgtgaag gccaatgccg ccctggggcc tccctcccgt 1560 
ctctgcaagg cctcctttct ccgggccttt caccaggcgg ccagggctgt ggggaagccc 1620 
tacctcctgg ccttgaagac ctacgaggct gccaaggctg ggccctacca ggaggctcgc 1680 
cggcagctgt ctctcctcct ggaccagcag ggcctggggg cttggccctc gaagccactg 1740 
gtgggcaaat tcagaaactg aagccagcct cggcgggacc gaggtcccgg agccaagctg 1800 
tacccctgct gggggagtgc cctatctgag gagcgttgtg gggagaacat gggttgtgcg 1860 
gggtgaaact ggggctggag ggaaggagga gcctgctgtg gttgggaggc ggctgctgca 1920 
cgtttgggct tgaataaaga agtttttctg gttaaaaaaa aaaaaaaaaa 1970 

<210> 75 
<211> 2968 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_ feature 

<223> Incyte ID No: 3101042CB1 

<400> 75 

gtatcagcac acacctagta tagattaatc tgtgcctatt tctggccata ttttataatt 60 
tgattaatac tgtcctttgg gaatccctga tccaatagga ttagtgtcct tatgagagcc 120 
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tcaacagagc ttgctctcat tctccctctg 
tctgcaagcc aagagaagaa acctgaatga 
tttccaacct ccaggacagg aagaaataag 
tgttttatta aggcagccca aacagactaa 
gcatgtggtt gatgttttag gaaagggtga 
aggaagagct ggcactattg gataaagccc 
aaaacttcag gaacctcatg ttagtgagag 
aggcaaaggg gttaagttac ctttcgcaag 
aaaggatccg ggatttaact gtgagtcagg 
ccccacattt agaagatgtt tccctcagtg 
ctgaaaatga aaactatgta gtaaatgcca 
aaagcctgac acaggttctt accccagaat 
cccagaactc tcagggaaga tataagggaa 
ctcagcatga tgacagcctc agttggacct 
aaggagagga ccctggtaga catcccagct 
ttgaaaaacg taatgcggcc catgtattac 
tggcctttgc agatgataca gatcctcatg 
cttataaatg tgaccagtat ggaaagaact 
gtaaaactca ctctggcaag actccctatg 
agagctcaga ccttcccaga tatcagaaag 
aagaatgtgg caagggcttc aggcgcagct 
caggggagat gccctacaaa tgcgatgaat 
tttgtattca tcagggagta cacacaggga 
agggctttga tcagagctcc aaccttcttg 
cctacaaatg cagtgagtgt ggcaagtgct 
ggaggtttca cacaggggag aaaccatata 
aatgtacaca ccttcacatt caccagagag 
atgtgtgtgg aaaggatttt gcgtatagct 
ctggagaaaa accatataaa tgtgaagtgt 
ttcacttaca tcaaagagat cacatcagag 
aaggcttcag ccggaattca gatcttaatg 
cctataagtg taaggcatgt ggtaagggct 
agagagtgca tatagatgag acacagtaca 
ctcatcaaag actacatgag cagagagaaa 
tagtaaaaca gaagccacac tgtattccag 
gctaggggag caaaagacag ggacactgcc 
gtgccaggca aggactgtgg atttcaagaa 
gctacaatgg caaagaggtt acttttgaca 
agtggtcaaa tttccattaa agggtgtgtc 
ggtaaggtct tcagttcagc caacgtcttt 
actctctgta tgaagaacat tcaaaagtgt 
gaaagaacgt tagtacccag gttttccatg 
gaagtggcat ttgaagggtg tggcagaggc 
ccttgcaaat aagaagtcta tttcccagta 
ttgagttcta gccaatggag tgtgagcaaa 
ttacaaacat cctcaggctt ctctatccct 
acgactgggt tgatagaagg tcactacttc 
aataaaagtt cattgtgtct caaaaaaa 

<210> 76 
<211> 4497 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_£eature 



ccacgtaagg acacagtgag agcagagctg 180 
aacctatctt ggtggtactt gatcttagac 240 
tttattttgt ttaatctaat cagactgtgg 300 
gaccagtcat ttgtgtggat gattgacgtg 360 
cattcaagga tgtggctgtt gtcttcacca 420 
agataaacct gtaccaagat gtgatgctgg 480 
acgggattaa aaacaacatt ttgaatcttc 540 
aagtgcttca ttgctggcag atttggaaac 600 
attatatcgt gaaccttcaa gaagagtgtt 660 
aagagtgggc aggcatttct cttcagattt 720 
ttatcaaaaa tcaagatatc acagcatggc 780 
cgtggaggaa agccaacata atgaccgagc 840 
tttacatgga agagaaattg tacagacgtg 900 
catgtgatca tcatgagtcc caagaatgta 960 
gtgggaaaaa cttgggtatg aaatcaacgg 1020 
cacagccttt cccatgtaat aactgtgggg 1080 
tccatcacag cactcaccta ggagaaaaat 1140 
ttagtcagag ccaagatctt atcgttcatt 1200 
aattccacga atggcctatg ggctgcaaac 1260 
tctcctcagg agacaaaccc tacaaatgta 1320 
cttcccttca caaccatcat cgagtccaca 1380 
gtgggaaagg gtttggattt aggtcacttc 1440 
aaaagcccta taaatgtgaa gagtgtggga 1500 
tccatcagag agtccacact ggagagaagc 1560 
ttagttcaag ctccgttctt caagtccact 1620 
ggtgtggtga gtgtggaaag ggcttcagcc 1680 
tccacacagg ggagaaacca tacaaatgca 1740 
ctgttcttca cactcatcag agagttcaca 1800 
gtggaaagtg cttcagttac agttcatatt 1860 
agaaaccata taaatgtgat gagtgtggta 1920 
ttcacctcag agtccacaca agagagaggc 1980 
tcagtcgtaa ttcatacctc cttgcccatc 2040 
cacactgtga gcgtggaaag gaccttctga 2100 
cattataaat gtagtaagtc agggttcaat 2160 
atgatagagg cttactcagc ctgtgggagg 2220 
atcgataatg tcagcctgca gcactgaagt 2280 
cttctgtgaa gctcccatca actgctgtat 2340 
aatgggtggt ttgtggtagt ggttgtagtt 2400 
caggaaggaa attctaattg ggatgatcat 2460 
agattttttt ttagtcccca gtagaaaaat 2520 
gctcaaaaat gaaacccatg ctcttactat 2580 
agattctcta cacaggcaag aagctccata 2640 
agtgctgtgt ttatcacact ggttccattt 2700 
acccttgcag ttaagagtgt gcccatgtga 2760 
agtgatataa gccactttca ggtctagcct 2820 
gccaaggtga ccttggaggc tgcttattcc 2880 
atctgtgttg gatttctttt tagagtaaga 2940 

2968 
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<223> Incyte ID No: 4972035CB1 
<400> 76 

cctcacgtac cacaacctca cgtaacctct 
atggcagcgc ccgccagcct tcccggccgg 
gtgggactag gccccgccga ggcctgaccc 
gccgggccgg gactgagggg ccgctccccc 
ccggggactg ggggccgggg ggtggggggg 
gggaaaaaag cggggccggg gtcaggagcg 
gggttggggg ggccggagca gagagcaccc 
atggagacca agacgatcgt gtacgacttg 
caagctctgc tggctccccc caagacggac 
aaggagcccc ggagaagcgg cagggccacc 
ggtggagatc tcctgtgctg cgaccactgc 
cctccactga gtgaagaaat gttgcctcct 
cgccgaaaga aacgagagca gaaaaaggag 
tctggcaaac ggactacatc ccccagcagt 
aaaactgaac taaaggccat tgcccatgcc 
ggcacaccca catccagcgc cagcacagag 
gaagacatca ttgacgtgga tgaggaacca 
cagctgaggc ggccctttga gctgctgatt 
tttcagttgc ccaatgaact gacttgtacc 
agaaaggagg aaaccacagg gaaaaatgtt 
ggtctcgttc ccttacccgt caaagtctgc 
cctctcatcc agtgtgacta ttgccctctc 
ctcactgcca tgcccctggg cagatggatg 
aaccagaaga atatgacact gagcaatcgg 
gtttcgcagc atgtcgtcaa agtggacttc 
aaccggcgtg tgctccagtc ggtcaaaaga 
tctcagtacc agtttccacc ccctctcatt 
atctgcaatg ggatccctga ggaatcacag 
acccaagcag agcagcaaga gtggctctgt 
aaacatttat ctgctaagca gatgccttcg 
gatattaagc ctgttattgt gactgacagc 
aagacaccta caccttccca ctaccccttg 
tccctgagct gctctccacc ccaccagtcc 
tgtgcggaaa aatccaagaa aaccccttgt 
gtgaaagcta atggcccaca cctctacagc 
cttccaggcg ctaacacccc actaccaggc 
ctcacgccac cagcggctgg gggccttcag 
gagaatgcca ctggccccag ctcttgcccc 
cccccttcca ttcccagctc ttgtgccagc 
actgtccaat cacagatagg acctccgttg 
aacgccaccc gggtgctcac Ccccccgcaa 
gccaaccaac gattcagctc accagcgcca 
ttatccatag gaagcgcttt aaccgtaccc 
gacctcacca actcacttcg agcatttatg 
ctggacgaga agctgatcaa gtttctggcc 
cgggtccaac cttcaccggg cagtgtcggg 
atagaagtgc aaagaaagga ggtacaggcc 
ggaggagctg tgaacatgtg ctatcgaacc 
gtgtgcctta caaactatgg tcactgtaac 
tacgatgaga ataccaaaca ttatgagctg 
gacaatgtgc tgtattcatg tgacttctcg 
attgttgcca aagtgcagag tgtcatcagg 
ccaagtgagg aggcagccat gatgagttcc 



caccccccct ccgatacccc cagccccggg 60 
accggctgtc gccagtgccg gctttagcct 120 
ccggagccgg gacccactgt gcagccagca 180 
ctcgcggccc gtcgggagga cgctgtcgtc 240 
agccagggcg aggaagtcgg gacccgtatt 300 
ccccctcccc tcacctcccc cctccccccg 360 
agcccgggag gtggatgaat gtgggagaaa 420 
gacacatcag gggggctgat ggagcaaatc 480 
gaggcagaaa agcgcagtcg gaagcctgag 540 
aaccacgaca gctgcgatag ctgcaaggaa 600 
ccggctgcct tccacctcca gtgctgtaac 660 
ggagagtgga tgtgtcaccg gtgcactgtt 720 
ctgggtcatg tcaatggact ggtggacaaa 780 
gacactgact tgttggacag atcggccagc 840 
cggatcctgg aaaggagagc cagcaggcct 900 
actcccacct ctgagcagaa tgatgtcgac 960 
gtagcagcgg agccagacta tgtgcagccc 1020 
gctgccgcca tggagcggaa ccccacccaa 1080 
actgcactac caggttctag caagaggaga 1140 
aagaagacac agcatgaatt agatcacaat 1200 
ttcacgtgta acaggagttg ccgtgtggct 1260 
ctgtttcaca tggattgcct cgagccgccg 1320 
tgtccgaatc acatcgaaca tgtggtgctg 1380 
tgccaggtgt ttgatcgttt ccaggacacc 1440 
ctgaaccgaa tccacaagaa gcacccccct 1500 
agaagcttga aggttcctga tgctataaaa 1560 
gcacccgcgg ccattcggga cggggagctg 1620 
atgcaccttt tgaactctga gcacttagcc 1680 
agtgttgttg cgctccagtg cagcatattg 1740 
.cattgggact ctgaacagac agagaaggct 1800 
tcagtcacca cctccctgca aacagctgac 1860 
tcctgcccct cagggattag cacccagaat 1920 
ccagccctag aggacatcgg ctgcagttct 1980 
gggactgcca atgggccagt gaacacagag 2040 
agccctactg attccacgga cccccggcga 2100 
ctctcacacc ggcaaggctg gccccggccc 2160 
aaccacaccg tcggcatcat tgtgaagaca 2220 
cagaggagtt tggttcctgt cccaagcctg 2280 
atcgagaaca ccagcacttt gcaaagaaag 2340 
acagattcaa ggccactggg ctcaccccca 2400 
gcagcaggag atggtatctt ggccacaaca 2460 
tcgtcagatg gcaaggtcag ccccggcacg 2520 
tctttcccag ccaactctac tgccatggtg 2580 
gatgtcaatg gagaaatcga gataaatatg 2640 
ttgcagagaa tacatcagct tttcccctcc 2700 
acacatcagc tggcttctgg agggcaccac 2760 
cgagctgtgt tctaccccct cttagggttg 2820 
ctctacatcg ggacaggagc tgacatggat 2880 
tacgtgtccg ggaaacatgc ctgcatattc 2940 
ttaaactaca gtgagcatgg gacaacggtg 3000 
gagaagaccc cgccaacccc cccaagcagt 3060 
cgccgccggc accagaaaca ggacgaagag 3120 
caggcccagg ggccgcagcg gagaccctgc 3180 



104/141 



WO 03/054219 



PCT/US02/41115 



aattgcaaag ccagcagctc gagcttgatt 
gccttactgc accatggcag ctacatcaag 
actgagtttg cgaccaaaca gcccaaaggc 
gccgagaagc tctctctcaa gccccaccag 
taggactggc ggctaccccg ccactggcct 
aaatgtacac aaaccaagcc cgggtgtttt 
atctttgcat gaaatgaaga aaaccttttg 
caagttctag ggggcatttg cacatatatt 
acctgagctg aggaacagcg tgggcaggga 
caacacaaaa cccttcccca cccacctcct 
taatcagaaa cttgttctat tttgtggcag 
acagttttca tacagcaaaa tctatacaat 
acccaccttc cttccatggt tcccaccctc 
actacaggta gctactgtgt attatggaca 
catctatttt atttcaaata aggaaaagtg 
atgacatttt ttcaatgttt ttaaaaaccg 
ctcaaattgt ctattgtagc aaaaatgttt 
tgttctcttg ccacttctct cctcttcctc 
cccacaacca gtaccaatgt acatagtaat 
ctgtattctg tctgtacttt aattacagat 
agcggatttc caccttataa aaggaacatt 
tcacactgtg gcacaaacaa gttctttttt 



gggggcagtS gggccggctg ggagggcaca 3240 
ctgggctgcc tgcagtttgt cttcagcatc 3300 
gatgccagcc tgctgcagga tggggtcttg 3360 
ggccctgtgc tgcgctccaa ctctgttcct 3420 
gtacacccac ccaagactcc tgcaatgcaa 3480 
ctatactcta ccagaaaccc ttcaactaca 3540 
actgtttttt aaaaatcctt tttcttttct 3600 
tgtactcaac atttcatggg aaagcggcag 3660 
gggaaagacc cagggtctgg acacttcctc 3720 
gctccctccc cctcgcccac cattgtaaaa 3780 
tgacaatagt tttatattaa aagaaaaaat 3840 
atcattgttt tatttaatat aaagatcgct 3900 
cacgttattt tccctttctg cagcggttgc 3960 
aatgagaaat gaattctttt tctggctgtc 4020 
tatttggatt ttgtgtaaat acatctagtg 4080 
tgtacagtac tacatgtggt agagcgtttt 4140 
ttgtcgtaaa cctgttttgt ctcctttttt 4200 
ctgcccctgg ttccctcctc tcctcccacc 4260 
tgtaatgttt tagactttac agaaactttc 4320 
agatataata gcatatgctt caaaaataag 4380 
ttcatattag ctgtattttt gaaagaaatt 4440 
catccccggg tgtggattcc cccaggg 4497 



<210> 77 

<211> 513 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 7506265CB1 

<400> 77 

cccacgcgtc cgcccacgcg tccgcaggag 
tgtctctagt gatccctgaa aagttccagc 
atgggcggcg gaaaatagcc tttgccatca 
ttatgcagaa tccacgccag tacaagatcc 
taaaggatgg aaaatacagc caggtcctag 
acctggagcg actgaagaag attcgggccc 
gtgtccgagg ccagcacacc aagaccactg 
agaagaaata agtctgtagg ccttgtctgt 
aaaaagggcg gccgctcgcg atctagaact 



gcctacacgc cgccgcttgt gctgcagcca 60 
atattttgcg agtactcaac accaacatcg 120 
ctgccattaa ggtggaacgt gtgatcacca 180 
cagactggtt cttgaacaga cagaaggatg 240 
ccaatggtct ggacaacaag ctccgtgaag 300 
atagagggct gcgtcacttc tggggccttc 360 
gccgccgtgg ccgcaccgtg ggtgtgtcca 420 
taataaatag tttatatacc aaaaaaaaaa 480 
agt 513 



<210> 78 
<211> 509 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 7506304CB1 



<400> 78 

caactcctgt cgtgcccgcc cggagaggaa 
cgcctcgccc ctcgcctttc ctctttccgt 
ccatgtctgc gcatctgcaa tggatggtcg 



gcggaaatgc gtgtacggta ttaaaggcgt 60 
ctcaggtcgc cgctgcgaag ggagccgccg 120 
tgcggaactg ctccagtttc ctgatcaaga 180 
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actgagccca ataacttgaa ggcccgcaat tccttccgct 240 

aagactgcca cctcctatgt gcggaccacc atcaacaaga 300 

agcatcagac acatgatccg caagaacaag taccgccccg 360 

cgcagggcca gcgccatcct gcgcagccag aagcctgtga 420 

cgccccacca agagctcctg agccccctgc ccccagagca 480 

caccaaaaa 509 

<210> 79 
<211> 5881 
<212> DNA 

<213> Homo sapiens 



ggaataagca 
acaacggact 
atgctcgcgc 
acctgcgcat 
tggtgaagag 
ataaagtcag 



gacctacagc 
gattcaccgc 
cacgctcagc 
ggcagccatc 
gaagcggacc 
ctggctttct 



<220> 

<221> misc__ f eature 

<223> Incyte ID No: 7506198CB1 

<400> 79 

ggagggcagg cggcggtggc agccatgttg ttgtttgatc tgtggatgaa atgaatcatg 60 
attttcaagc tcttgcatta gaatctcggg gaatgggaga gcttttgcct accaaaaagt 120 
tttgggaacc tgatgattca acaaaagatg gacaaaaagg catatttctt ggggatgatg 180 
aatggagaga gactgcatgg ggagcttctc accattcaat gtcccagcct attatggtac 240 
agagaagatc tggacagggt tttcatggaa acagtgaagt aaatgcaata ctgtctccgc 300 
gatcagaaag tggaggcctt ggtgtgagca tggtagaata tgtattaagt tcttctcctg 360 
ctgataaatt ggattctcga tttaggaagg gaaattttgg cactagagat gctgaaacag 420 
atggacctga gaaaggagat caaaaaggca aggcttctcc atttgaggag gaccaaaaca 480 
gagatcttaa acaaggagat gatgatgatt ctaaaataaa tggcagaggt ttgccaaatg 540 
gaatggatgc cgattgcaaa gattttaatc gtactcctgg aagtcgtcaa gcctctccaa 600 
ctgaagtagt tgagcgcttg ggccccaata ctaatccctc agaaggactg gggcctcttc 660 
ctaatcctac agctaataaa ccacttgttg aagaattttc aaatcctgaa actcagaatc 720 
tggatgccat ggaacaagtt ggtctggaat ccttacagtt tgactatcct ggtaatcagg 780 
taccaatgga ctcttcagga gctactgtag gcctttttga ctacaattcc cagcagcagc 840 
tctttcagag gactaatgca ctaacagttc aacagttaac tgcagctcaa cagcagcaat 900 
atgcattagc agcagctcag cagccacata tagctggtgt attctcagca ggccttgctc 960 
cagctgcatt tgtgccaaat ccatacatta ttagtgctgc tcctccaggg accgatccgt 1020 
atactgcagc aggattggct gcagcagcta cattagcagg tccagcagtg gttccacctc 1080 
agtattacgg cgttccatgg ggggtgtatc cagccaactt atttcagcag caagctgcag 1140 
ctgcggcaaa taacacagcc agtcagcaag cagcatcaca agctcagcct ggacagcaac 1200 
aggttctccg tgctggagca ggtcagcgtc ctcttactcc caatcagggt cagcaagggc 1260 
agcaagcaga atcacttgcg gcagctgcag cagcaaatcc aacattggct tttggtcagg 1320 
gtcttgctac tggcatgcca ggctatcaag tactagctcc aactgcctat tatgatcaga 1380 
ctggtgcctt agtggttggc cctggagcaa ggactggcct tggagctcca gttcggttaa 1440 
tggctccaac acctgtttta attagttcag cagcagcaca agctgcagca gcagcagcag 1500 
ctggaggaac tgcaagtagc cttacaggca gcacaaatgg tctgtttcgg ccaattggca 1560 
ctcagccacc acagcagcag caacagcagc caagcactaa tctgcaatct aattcatttt 1620 
atggaagcag ttctttgact aatagctccc agagtagttc tttattttct catggacctg 1680 
gtcaacctgg aagtacatct cttggctttg gaagtggtaa ctctttgggt gctgctatag 1740 
gctcagccct cagtggattt ggttcatcag gaggactgac aaatggtagt ggtcgatata 1800 
tctctgcagc acctggagca gaagcaaaat atcgaagtgc ttcaagcact tccagtctat 1860 
ttagctccag cagccagctc tttcctcctt cccggcttcg gtataatagg tctgatatta 1920 
tgccttctgg ccgcagtaga ttattggaag atttcagaaa caaccgcttc ccaaaccttc 1980 
agcttagaga cttgattgga catatagttg agttttctca agaccagcat ggttctagat 2040 
tcatacagca aaaactagag agagctactc cagctgagcg acagatggta tttaatgaaa 2100 
ttctgcaagc agcctatcaa ttaatgactg atgtttttgg caactatgtt atacagaagt 2160 
tttttgagtt tgggagtctg gatcaaaaat tagccctggc tactcgtatt cgtggtcatg 2220 
ttctaccctt agccttgcag atgtatggct gccgcgttat tcagaaagca ttagaatcta 2280 
tttcttctga ccagcaggta attagtgaaa tggtaaagga gctggatggt catgtgctca 2340 
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aatgtgtgaa agatcagaat ggaaaccatg 
cacagtcact acagttcatc attgatgctt 
atccttatgg ctgcagagta attcagcgca 
tacctatctt agaagaactc caccaacata 
attatgttat tcagcatgta ctggaacacg 
ccgaaatcag gggaaaggtt ttagccctga 
aaaagtgtgt tactcatgcc tcccgtgctg 
gccagaatga tggtcctcac agtgccttat 
acgtggttca aaagatgatt gatatggctg 
agattcgacc tcacattact actttgcgca 
agttggaaaa gtattatttg aagaatagcc 
atggaatgct gtaaattaca ggagcaagag 
ttttttttgt gtgtgaatta tcaaaacaca 
aagcaaaact atttattgac tttattcatc 
tatttggggg gtgggggatg aattataaat 
attggattgc tggcaaagca cagaatgcct 
agctgtcaca tattttgtaa atttttacct 
aaaagtacag tattctttta ataaactggc 
cacaacaggt ttatagagat gtatatagaa 
aaccttttat aacagattaa caatcaactg 
taagttgtat tttgataatt cacaaactat 
aaagtggttt gagatgaaaa gaacctaaca 
ttactgccct aaaatgtctc tggcaatgta 
ttgttgtaag agttaaatac aaaatcatgg 
aagttaagtc acttattaac ttttcagttt 
attttgtttt gcttatagaa ttacagacat 
tttcatagag gcagaaagta ggaaccagat 
tttgatagct cagcatttaa gcatgattac 
aatatatgca ttgtgtgtgt agtaatagat 
ttcaatgttt attataaaag ccttgctaat 
gtgtacattt gtaaaccttc atgctgtaaa 
ttgcatttta gtgtacattt acgtccctgc 
aatgtaaatt tatttctcca aatcgagagt 
gtttcagaag tatgaaccag ctttcttttt 
tagttgttga ctgttaatat ggacctgcta 
gtattgtgca taatagaaag tattggactg 
tttttcatct tattaaatgt gatgtgactt 
tgaatagcct actcccaagt aagagcaaat 
aagacataac agtaacacga tgtacattta 
ctttttaagg caaaattgtg accatatgtg 
tatgaaaata ttttggaaaa aaacttgctg 
gtgctttgta ttttgttgaa gtcagtattt 
ttcagtgctg aaaactaaaa tgaacatgct 
tttttactag agatctgcag aagaaacgca 
ttttatcact tagttgttca tattatgaac 
ccaggcatac tttttcatgt ttggtgttga 
tggtgagacc attaaatcca aacacttgtc 
tcagcatgtc ccagtcactg cagcaatgcc 
acaaagccgc tgtctcactt tttcctactt 
gaacttttgt tctcagtatc agcccatggt 
ggtaatggct ttcctgtctt tgtacagttg 
tgttggcaca ggcattatct ctgcaatttt 
agaaactaaa ccctcttctt ggggtcctga 
acccaatctt cccaatactt tcaggcctgc 
ttacagtctg ccattttggg tgcccacccc 
attttggtaa aatctgaaaa tcacatttca 



ttgtacaaaa atgtatcgaa tgtgttcagc 2400 
tcaagggaca agtatttgtg ctttcaactc 2460 
tcctagagca ttgcactgca gaacagacct 2520 
cagagcagtt ggtacaggat cagtatggca 2580 
gtcgacctga agacaagagc aaaattgttt 2640 
gtcaacacaa atttgccagc aatgtagtag 2700 
agagagcttt actgattgac gaggtttgct 2760 
acaccatgat gaaggaccag tatgccaatt 2820 
aacctgctca gagaaagata atcatgcaca 2880 
aatacacata cgggaagcat atactggcca 2940 
cggacctagg acctattgga ggaccaccaa 3000 
aaagaagata atttaaccat gtgaaaagaa 3060 
actcaactat gaatcttcaa ttttttttta 3120 
catttgtaaa ttttttaagg ttcttgtgta 3180 
tatattcagc cctgagtgga gacctatcag 3240 
gtatatgatg taactgtatc aaaaataaaa 3300 
tgtaaagtca caaaaatagt ttttaaagga 3360 
tcacagtctg gtaggtctac aaccccatag 3420 
ttatagtcct tatttttttc ctttgcgtga 3480 
cataaatatt attaatattt taaaaagagt 3540 
catgcaaata acgagtaagt agacaagaat 3600 
ttatttacag tagatgtggt tttaatacaa 3660 
cagaaatatt gtatatactt acatatgtaa 3720 
tgacacttcc aattaagtgc actaaatgaa 3780 
ggtttgcaat gagaaagagt ggaaatttgt 3840 
gttgaggaag tgttgagctt tattttgctt 3900 
agagatgaaa aggggccact gaaaagtgaa 3960 
atattcagat agctcttttt gctttctata 4020 
gtaagtttac actttgaaag gaaatcttgt 4080 
ttagtagtga tgctttcctt ggttgtacag 4140 
tggaatttgt tttatctctt tgggatacat 4200 
cctctttgac ctggcaatat agtgttgtat 4260 
gattttttaa aaatttttta tctttatatg 4320 
attattgtga gatcattttg ttttataaca 4380 
gaatttggat cactttcaat tgaagtcagg 4440 
agatatttgg ttaccatgga ggccaatgct 4500 
ttttctttgt acagaagagt actgtatttt 4560 
ctgtatgata acattttttc ctctggacat 4620 
caagcggcct tatgtacatt tcccaacaat 4680 
tataattaaa atcgttttta atcctttgcc 4740 
tgtatattca gtttctgaaa gataaagaaa 4800 
tgtataaaca tttatgttga cccacttatg 4860 
attctgtcag ctgaatatgg aagagatctt 4920 
atcttctgag cacaatatgg aatctaaagg 4980 
ctaaaaataa tggcataaag tttggggatg 5040 
gttattttac ttttctaacc caacattcct 5100 
accgttcctt ctcatagtca ctctgggtca 5160 
ttgtgtttgt ttcatttttt taaaacccac 5220 
taccaacctc agagtatttc ggcccgtatc 5280 
ttcaggatca aagctgtcat gttggagatt 5340 
aattcctagt cttccttcat ccttgccctc 5400 
agaaaatgac aagtagagaa tactacattg 5460 
tactcattcc catttgtccc agtgctgaca 5520 
tctacaaaag tacctgttct tgtagaaatt 5580 
aatttttacc ttttagtaag ttggcatgaa 5640 
gaataaaaca attgggcaaa actacctagg 5700 



107/141 



WO 03/054219 



PCTYUS02/41115 



ctttactctt gagtgtctcc ttttgatagg 
tggctcttat tggttcatat gaaataatgt 
attagaaaat gaaaaatgtg tgaataacat 



gattgtttct ggaccagttt gtctaagtcc 5760 
taacttcact tctttgtata ttatgtataa 5820 
tgtatgaaat aaacctggtc ttgtgttttt 5880 

5881 



<210> 80 
<211> 5712 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 1381261CB1 



<400> 80 

cagaaggaag aggcccagca gaagggactc 
ggctgtgctc tcaggctggc accatgcccc 
cccccgccgt cctgcgccag ctcaatgaac 
ccctcatagc cggagacacc aagttccctg 
ccttcttcag agaggccctg ctcacttcag 
gcgccgcacc caaccctgcc accaccacag 
cttcttcctc ttcttcctcc tctgcttctt 
ctccagcctc tccccctgct tcttccccac 
cagctgcgtt ttctgatgtc ctcaacttca 
gtggagggga cggggcagct gtagcagaga 
cccgccttca gggcctgggg gagggaggtg 
tggccacctc gcagcctgaa gaggacagct 
gggagggtga cagggctgag gcccaggccc 
ccctcccctg cccccagtgt ggaaaaagct 
aggcccagtg ccgacgaggg gccagcacgc 
ctggccctgg tggtcctgca ggggtggacg 
gagggggccc cgagcacgtg gtgaaggtgg 
cctgcgagcg ttcctacgtg accctgtcca 
ggcggaggaa gtacccctgc cgctattgtg 
cgaagcatga agtgtggcac acgggggagc 
cctttgtcac ttactataac ctgaagaccc 
gcctccttgc cagtgagaag acacccaatg 
agctctaccg cctgctcccc atgcgggcag 
gagccccgga ggctcccctt tctccaaccc 
ccagcccgcc gcctgggcct ccacctgccc 
cttttgccca cccagccccc tctgtcattg 
ggagtgggac ggccagcaca ggagggtccc 
ccccgaggcc acccaagaaa cgagaatacc 
ccaccagccc agccacagca gtcagcccag 
ccacggagga ggccaagggc cggaatccac 
agccagtggg cgggattggt ggaggtgggg 
ctcagctgca ggctccacct ccactgtgtc 
tcgtcaagcg ccgcatctca gagactgacc 
tggaggagag tgaggaggac gaagaggagg 
aggaatcaaa ggctggtggg gaggaccagc 
agcgcaaggc tggagctgct ggaggtgcca 
gccggccacc acgttggagg cagaagctgg 
ccgagagccc agcgggacgt gcccgcacag 
agaccttcac caccctgaga aagctgcgga 
acagctcccg ggccggacgg aggccctcca 
tgtgcaagac cgcagctgcc ctgagccgcc 



gctgagccac cccctgcccc actccatgtt 60 
cccctgcaga ggtgacggac ccgtcccatg 120 
agcggctccg tggcctcttc tgtgacgtca 180 
ctcaccgcag cgtcctggct gcttcaagtc 240 
ccccactacc ccttccacca gctactgggg 300 
ctgcctcttc ctcctcctcc tcttcctcgt 360 
cttcttcttc ctcttcctct tcctctcccc 420 
cccgggtcct ggagttgcca ggagtcccag 480 
tctacagcgc ccggctcgca ctgcctggtg 540 
ttggagcttt ggggcgccgt ctgggcatct 600 
atgcctgggt acctcctacc ccagccccca 660 
ttgggcccgg gcccaggcca gctggggagt 720 
ctgacttgca gtgctccctg ccccggcggc 780 
tcatccatcc caaacggctg cagacccatg 840 
gggggtctac agggctggga gctgggggcg 900 
cctcagccct gcctccacca gtgggcttcc 960 
tgggcggcca cgtgctgtat gtgtgcgcgg 1020 
gtctgaagag acacagcaat gtacactcgt 1080 
agaaagtgtt tgctctggcg gagtaccgca 1140 
gcaggtacca gtgcatcttc tgttgggaga 1200 
accagcgagc cttccacggc attagcccgg 1260 
gaggctacaa gcccaagctc aatacactca 1320 
ccaagcggcc ctacaagacc tacagccagg 1380 
tcaacacacc ggcccctgtg gcaatgccag 1440 
cagagcctgg ccctccaccc tctgtcatca 1500 
tccatggggg cagtagcagt ggtggagggg 1560 
aagctgcctc ggttatcact tacactgctc 1620 
cacctcctcc ccctgagcct gcagccacac 1680 
ccaccgctgc agggccagcc atggccacca 1740 
gggctggaag gactctgact tacacagcca 1800 
gtccccccac aggggctggc cggggcccct 1860 
agatcactgt gcgaataggg gaggaggcca 1920 
tgcgtcctgg ggagctgagc ggagaggaga 1980 
aggacgaaga ggaggaggag gaggatgagg 2040 
tctggaggcc ctactactcc tacaagccta 2100 
gtgtgggggg cagtgggctg ccccgaggcc 2160 
aacggaggag ctgggaggaa accccagcgg 2220 
agcggaggca ccgatgcggg gactgtgccc 2280 
agcaccaaga ggcccacggt gggggctccc 2340 
cccgctttac ctgcccccac tgcgccaagg 2400 
acgggcagag gcatgctgct gagcggcccg 2460 
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ggggcacccc aacccctgtc attgcctatt 
atgtcaagga ggaagccccc caagagatgc 
gcgggagcac tgctgctgag gaagcttccg 
cagggggtga ggagccccca gtagtggcaa 
aggaatttcc actggccttg attgggggcg 
ctgggagtga agggccagtg ggggctggtg 
ccaaagtcac tttctaccct gagccctacc 
cctaccctta caacttcagt aacttggccg 
ctgatgagaa gggtgcgggg gcccttccct 
atcctcaagc agcaccccct gccccaccaa 
ttccccctaa gggagaaggg gaaagggcag 
ggtgaaccct ggggctcaat ccccctttca 
actaccagct ccctggcctc cctgcccctt 
ttgggggccc ctggagctca ggggtcaggc 
cctgggaagg' gatctttcga ggttcccctc 
aggggcaggg ccagcttcca tcccttctcc 
acctgaatct ctactcacag cccccaccag 
gtaaacctag gggaaaccat ctctctcacc 
taagcccttc ccagttgctt cctagcacat 
gaccattgtg atacctgacc ccgcccacct 
cagcgtgggt ctctacgtcc ctgtttccct 
cctttggatc ttccatgttc ctccccatat 
ccaggtcaag gaggtttggg ggagggttgc 
ctttcccagt ccagggcagg gaaatattgg 
ccagattctt ccaaggcaaa agaggtaagc 
gcctattctg ggctagacca gatttggggg 
gaagggaatc tactttctcc ctgttttttt 
atagccagaa aaatgatagg ggtcggatgg 
tccatcctta cctgtattcc catctcccca 
ggttttgggg gaaccattct actcttctgg 
cttcaaaata gcaccttaca ccccatcttt 
cctagggatg gggtacaggg actttaaata 
cctggtcagg gtccctgtat ttccttccag 
ctttgaaaac tgggcctccc tgctctgtga 
ccccccaaaa aatagctcac aaggggagag 
gaattagagg agtgcagttt taaaaggaaa 
cccagctact gtttttgggg ccaagatggc 
gatcatggct tttggaggga ggtgagttta 
ataccctccc agcatataca aaaggggagg 
agaggagtca ctccttcatt cctcctctgt 
gcctgagtga cgaatttcct actgaatgta 
tttcaaatgg ggaaaaacac acaaaaaaaa 
ctggcacttt agaaaaactg caaaaaaata 
cacacaaatt atatatctat ctatctatac 
ctggaggcag gggcagaggt gacgacagtg 
ggcaaacagg catgggaaaa tggaagggtt 
ataggtcaaa attccaaaac catggacatt 
tttttttttc ttaaatatga tcaaggaaaa 
caaatgagat tgtggcgacg tggagattaa 
tattgtgagt ttcagatgtt ggaaatttgg 
caagtcttgt cagttcgtgc cctctttccc 
agtgagaagg ccactggttc tgtgccgcag 
tctatacgta gtgaggaccc agatttagag 
gtgtgtgtcc aactctgtag gccaataaac 
aaaaaaaaaa aa 



ccaagggcag cgctggcacc aggcccgggg 2520 
aagtctcctc atccagcggt gaggcaggtg 2580 
agaccgcctc actccaggac cctatcattt 2640 
gcgggggcag ctatgtatac ccacctgtgc 2700 
gccgggaacc tggcggtggc aggggaaaat 2760 
agggggaccg gatggagggg ataggggctg 2820 
cgctcgtcta tggcccccag ctccttgccg 2880 
ctctcccggt tgctctcaac atggtcctac 2940 
tcctaccagg ggtctttggc tacgcagtga 3000 
caccacctcc cccaactctt cctccaccaa 3060 
gggttgagag aacccagaag ggcgatgtgg 3120 
ccagatgcca ccctccctga accccccacc 3180 
gggagcccct tcacactctt gtgcagggac 3240 
tgctttgtgt gagatgtagt tttcccatct 3300 
tcagtcttcc tccagggaat ggcctccatg 3360 
agcccttggg gcaactgagc aatatactta 3420 
ctctgaatgt ctaacctgct cccctgattc 3480 
taatgacccg ccttgttctg aagctttctc 3540 
tccattcttt gtggcccagg gcctggacca 3600 
gggagtgtgg ctttgggttt catccttccc 3660 
tgtatcaaga caccttcctc agcttccatg 3720 
tcctggactt cggagatggc ctctcccaag 3780 
ccctctgccc ctctgttctg tggctgagca 3840 
ccctatcttg acccccaaat ccagtgagct 3900 
agatcacacc tctttctgcc tctacatatg 3960 
ccaggaggga agaactccat atgggatgga 4020 
ttcctgatgg tttctcccag actagaccaa 4080 
gtgggtaagc ccaggatttg cacatgacct 4140 
gtgtcactcc cctcaccaat cactccagat 4200 
tgggctttgg ggtatcccca ccaactttcc 4260 
gactcagttc cccacaccca aagatcccag 4320 
gtccctaatc cctaatttgc actagttaac 4380 
tgggggagat aaatgtttgc tcctaattct 4440 
ttggataaat atttcccatc ccacccacct 4500 
ccagtatggg ggagcaaatt tgacaaatgg 4560 
agttgctgtc atcaaaatgg cagccttttc 4620 
tgccctagca gcaatcactg ccaagggcaa 4680 
gggagggcca ggaccatcct cctacccctc 4740 
ttttagacag gctccctgaa tgttaaccac 4800 
ctctttgcac ttttcttggt cttggccaca 4860 
ccaagttcca atttttaagg gggggaaagg 4920 
aatcactaaa aattcccaca aatcttgttt 4980 
cgtaataaag aatacatata tatatatcta 5040 
agcggaacca caagagagac tgaggaaggc 5100 
cccctatatc cttaacccat actcctctga 5160 
gaggatggac cggagaattg gaacttcaga 5220 
tttttttggg agaattgaga ttgtagacat 5280 
tagcttccag aatgtggtgg ttctgggcaa 5340 
aatatatgta tttgagctgg ggaatttgaa 5400 
gattttgcag ttttgtcttt tgaaaatgat 5460 
catgttccct gggaagacgg gtggtggcag 5520 
cacgcaaaat ttagaattct acagactagc 5580 
aaactgacca atatttatct ccgcatttgt 5640 
caacaagaca aatgaactgt gctcaaaaaa 5700 

5712 
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<210> 81 

<211> 1172 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 6803876CB1 



<400> 81 

gaagagccgg gtggcagctg acagaggaag 
ccatatatat atagtttaag aaggctctcc 
tctaagcaca gcttctcttc tcctattttc 
aaagtttaca ttgtagttat tttccaatct 
ttgctgccgt aatttaaagc tctgttgatt 
agcactgtgt ttatgctgga atatgaagtc 
gagccgccgg acggacgctc gtcttcgccc 
acggtgctgc gcaagaaggg ccctacggcc 
gcggcacaga gacgaggaga agatgtggag 
aaacaacatt ctattaccaa gaacacggcc 
catgacaggg tgaccctgga ggtgggcaag 
cttacgcaga aggacctggc cacgaaaatc 
gagagcggac gggccatacc caataaccag 
gtgggaacca ggagtgctcg tgtgctgaga 
cccttccctg ggaagatgtt gaagaggggc 
cagtggccag tggctgtggc tcatttcagg 
gcccatcgag aaggggccta gggcgaaatg 
gctgatctcg ttccgccggt tccccttggc 
ggaacaaggg gtccagcttg cgggggaccc 
aaaaccttgc aaagcgcaaa aaaaaaaaaa 



ccgctccaat accttcacca taaatagtgg 60 
atttggcatc gtttaattta tatgttatgt 120 
atcctgcaag caactccaaa tatttgaaat 180 
ttgcttgata agtattaaga aatattggac 240 
ttgtttccct ttggattttt gggggagggg 300 
tgagaccttc cggtgctggg aacgcccgct 360 
gccatggccg agagcgactg ggacacggtg 420 
gcccaggcca aatccaagca ggctatctta 480 
acttccaaga aatgggctgc tggccagaac 540 
aagctggacc gggagacaga ggagctgcac 600 
gtgatccagc aaggtcggca gagcaagggg 660 
aatgagaagc cacaggtgat cgcggactat 720 
gtgcttggca aaatcgagcg ggccattgat 780 
gcccagtgag gcacgggagc ggcgtggccc 840 
agcgttgggg ggaggaggga gctggtgctg 900 
cctcaagctc cggggaaagg acattggaaa 960 
aacacaaagc ctcgaaatca gtgcgctcca 1020 
cgccagttcc gttctcctca cgggccgaac 1080 
tccccagccc attcctgctg tcaaacaaac 1140 
aa 1172 



<210> 82 

<211> 3424 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> mi sc_f eature 

<223> Incyte ID No: 7506281CB1 



<400> 82 

agtgactgcg cggcccgacc 
gacgcatccc agtctgggcg 
agagggagcc ccggctccga 
cgcgacgcac aaatccacat 
cccaagtttc agggaaggaa 
ccatgccgat ccccgaggac 
acagagtcgt ggtcattaaa 
agataggatc agagagatct 
agagctctat gcctcagaaa 
gcagcgccag ctacgagaag 
tcaacaacgc catcaactac 
cgggcggttc cgaggtggtc 
cggagggcac cccgcgctcc 
tgctctccaa ggccaagttg 
aagactccac ggacaccgag 



cggtgcggtg actgcggcaa 
ggacgctcgg ccgcggcgag 
ggttgctctt cgcccccgag 
aacctgagga ccatggatgc 
agcccccctg taagcgatac 
ctctccacca cctcgggagg 
gaagaaacta atcacagtga 
ctcgtgctgg acagactagc 
tttcttgggg acaagggcct 
gagaacgaaa tgatgaagtc 
ctgggggccg agtccctgcg 
ccggtcatca gcccgatgta 
aaccactcgg cccaggacag 
gtgccctcgg agcgcgaggc 
agcaacaacg aggagcagcg 



gacccctggg tccccgcgcg 60 
gcgggcaagc ctggcagggc 120 
gatcagtctt ggccccaaag 180 
tgatgagggt caagacatgt 240 
tccagatgag ggcgatgagc 300 
acagcaaagc tccaagagtg 360 
aatggcagaa gacctgtgca 420 
aagtaacgtc gccaaacgta 480 
gtccgacacg ccctacgaca 540 
ccacgtgatg gaccaagcca 600 
cccgctggtg cagacgcccc 660 
ccagctgcac aagccgctcg 720 
cgccgtggag aacctgctgc 780 
gtccccgagc aacagctgcc 840 
cagcggtctc atctacctga 900 
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ccaaccacat cgccccgcac gcgcgcaacg ggctgtcgct caaggaggag caccgcgcct 960 
acgacctgct gcgcgccgcc tccgagaact cgcaggacgc gctccgcgtg gtcagcacca 1020 
gcggggagca gatgaaggtg tacaagtgcg aacactgccg ggtgctcttc ctggatcacg 1080 
tcatgtacac catccacatg ggctgccacg gcttccgtga tccttttgag tgcaacatgt 1140 
gcggctacca cagccaggac cggtacgagt tctcgtcgca cataacgcga ggggagcacc 1200 
gcttccacat gagctaaagc cctcccgcgc ccccacccca gaccccgagc caccccagga 1260 
aaagcacaag gactgccgcc ttctcgctcc cgccagcagc atagactgga ctggaccaga 1320 
caatgttgtg tttggatttg taactgtttt ttgttttttg tttgagttgg ttgattgggg 1380 
tttgatttgc ttttgaaaag atttttattt ttagaggcag ggctgcattg ggagcatcca 1440 
gaactgctac cttcctagat gtttccccag accgctggct gagattccct cacctgtcgc 1500 
ttcctagaat ccccttctcc aaacgattag tctaaatttt cagagagaaa tagataaaac 1560 
acgccacagc ctgggaagga gcgtgctcta ccctgtgcta agcacggggt tcgcgcacca 1620 
ggtgtctttt tccagtcccc agaagcagag agcacagccc ctgctgtgtg ggtctgcagg 1680 
tgagcagaca ggacaggtgt gccgccaccc aagtgccaag acacagcagg gccaacaacc 1740 
tgtgcccagg ccagcttcga gctacatgca tctagggcgg agaggctgca cttgtgagag 1800 
aaaatactat ttcaagtcat attctgcgta ggaaaatgaa ttgtttgggg aaagtcgtgt 1860 
ctgtcagact gccctgggtg gagggagacg ccgggctaga gcctttggga tcgtcctgga 1920 
ttcactggct ttgcggaggc tgctcagatg gcctgagcct cccgaggctt gctgccccgt 1980 
aggaggagac tgtcttcccg tgggcatatc tggggagccc tgttccccgc tttttcactc 2040 
ccataccttt aatggccccc aaaatctgtc actacaattt aaacaccagt cccgaaattt 2100 
ggatcttctt tctttttgaa tctctcaaac ggcaacattc ctcagaaacc aaagctttat 2160 
ttcaaatctc ttccttccct ggctggttcc atctagtacc agaggcctct tttcctgaag 2220 
aaatccaatc ctagccctca ttttaattat gtacatctgt ttgtagccac aagcctgaat 2280 
ttctcagtgt tggtaagttt ctttacctac cctcactata tattattctc gttttaaaac 2340 
ccataaagga gtgatttaga acagtcatta attttcaact caatgaaata tgtgaagccc 2400 
agcatctctg ttgctaacac acagagctca cctgtttgaa accaagcttt caaacatgtt 2460 
gaagctcttt actgtaaagg caagccagca tgtgtgtcca cacatacata ggatggctgg 2520 
ctctgcacct gtaggatatt ggaatgcaca gggcaattga gggactgagc cagaccttcg 2580 
gagagtaatg ccaccagatc ccctaggaaa gaggaggcaa atggcactgc aggtgagaac 2640 
cccgcccatc cgtgctatga catggaggca ctgaagcccg aggaaggtgt gtggagattc 2700 
taatcccaac aagcaagggt ctccttcaag attaatgcta tcaatcatta aggtcattac 2760 
tctcaaccac ctaggcaatg aagaatatac catttcaaat atttacagta cttgtcttca 2820 
ccaacactgt cccaaggtga aatgaagcaa cagagaggaa attgtacata agtacctcag 2880 
catttaatcc aaacaggggt tcttagtctc agcactatga cattttgggc tgactactta 2940 
tttgttaggc gggagctctc ctgtgcattg taggataatt agcagtatcc ctggtggcta 3000 
cccaatagac gccagtagca ccccgaattg acaacccaaa ctctccagac atcaccaact 3060 
gtcccctgcg aggagaaatc actcctgggg gagaaccact gacccaaatg aattctaaac 3120 
caatcaaatg tctgggaagc cctccaagaa aaaaaataga aaagcacttg aagaatattc 3180 
ccaatattcc cggtcagcag tatcaaggct gacttgtgtt catgtggagt cattataaat 3240 
tctataaatc aattattccc cttcggtctt aaaaatatat ttcctcataa acatttgagt 3300 
tttgttgaaa agatggagtt tacaaagata ccattcttga gtcatggatt tctctgctca 3360 
cagaagggtg tggcatttgg aaacgggaat aaacaaaatt gctgcaccaa tgcactgagt 3420 
gaag 3424 

<210> 83 

<211> 1882 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7506175CB1 

<400> 83 

ctttccagca gagcggggcg gggcggggct gagcgtgttt acatccgccg ggtgcgcggc 60 
ttcgccgccc gaggtcgttc ggctcgggta ccatcctccg cgccatggac accagcgacc 120 
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tgttcgccag ctgcaggaag ggggatgtgg gccgagtgcg gtacctgctg gagcagcgag 180 

acgtggaggt gaatgtgcgg gacaagtggg acagcacccc cttgtactat gcctgcttgt 240 

gtgggcacga ggagctggta ctctaccttc tggccaatgg agcccgctgc gaggccaaca 300 

ccttcgatgg tgagcgctgc ctctatgggg cactgagtga ccccatccgc cgggctctac 360 

gcgattacaa gcaggtcacg gcttcctgca ggaggcggga ttactatgac gacttcttgc 420 

agcggcttct agagcagggc atccacagtg acgtggtctt tgtagtacac gggaagccat 480 

tccgggtgca tcgctgcgtc ctgggtgcac gtagtgccta ctttgccaac atgctggaca 540 

ccaaatggaa gggcaagagt gtcgtggttc tcaggcaccc actgatcaac cccgtggcct 600 

ttggggccct gctgcagtac ctgtacacag tggcgtctaa gccaggcacg tgtgtgaagg 660 

tgctgaccat cgagccccca cctgcagacc cccgcctccg ggaggacatg gcgctgctgg 720 

ccgattgtgc cctgcccccc gagctccgag gtgatctttg ggagctgccc ttcccttgtc 780 

ctgacggctt caacagctgc cctgacatct gcttccgagt ggctggctgc agcttcctct 840 

gccacaaggc ctttttctgt ggccgcagtg actacttccg agccctgctg gatgaccact 900 

tccgagagag cgaggagcca gcgacctcag ggggcccccc agccgtcacc ctgcatggca 960 

tctcacccga cgtcttcact cacgtgctct actacatgta cagcgaccac actgagctgt 1020 

cccccgaggc agcctatgat gtgctgagcg tcgccgacat gtacctgctg ccaggcctga 1080 

agaggctgtg cggccgcagc ctggctcaga tgctagacga ggacactgtg gtgggtgtgt 1140 

ggcgcgtggc caagctcttc cgcctggcgc ggcttgagga ccagtgcact gagtacatgg 1200 

ccaaggtcat tgagaagctg gtggagcggg aggacttcgt ggaggcggtg aaggaggagg 1260 

cagcggctgt ggcagcccgg caggagacgg actctatccc gctggtggac gacatccgct 1320 

tccacgtggc cagcacggtg cagacctaca gcgccataga ggaggcgcag cagcgtctgc 1380 

gggcactcga ggacctgctc gtgtccatcg gtctggactg ttgagcccct ggctgggcag 1440 

ccccaggggc caggagctct cttggagaca agcatgtgta tgcgtttgtg tgcagctctt 1500 

cttcctgctc cctgcacatt gagggcttca tggggggtgc gaggggctca gtggggcttc 1560 

tcttccctcc atgagcctgg agaccccagg ggaggatcca tttgggatga gccccctccc 1620 

cccaatgcac aagccagccc ccaagaccct gggggtggac accactcagg gaaacctggg 1680 

gtgggggtgg gctttggtct tagcactttc cttctccaga tcccccctac ccaccccagt 1740 

cccaaatcca gtcctctggc ccttgcctag ccctgaattg cttctctaag ctggtgttcc 1800 

catgcacagg gccattcagg aagggctggg ggagtgtgtg tggcaataaa gcttgaaggc 1860 
accgtgggag caaaaaaaaa aa 1882 

<210> 84 

<211> 2223 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7506303CB1 



<400> 84 

gtgcagcctg atgcgcaacg tggggactca ggcgcgctgg gcggcaggag ttgcttccgg 60 

ccgtgttggt ggtctgaatt gagaagccgc gactaaggga agatggagac aatactggag 120 

cagcagcggc gctatcatga ggagaaggaa cggctcatgg acgtcatggc taaagagatg 180 

ctcaccaaga agtccacgct ccgggaccag atcaattctg atcaccgcac tcgggccatg 240 

caagatatct gtgtgccaat gtcagtggaa tttgaggaac tcctgaaggc tcgagagaat 300 

ccaagtgaag aggcacaaaa cttggtggag ttcacagatg aagagggata tggtcgttat 360 

ctcgatctcc atgactgtta cctcaagtac attaacctga aggcatctga gaagctggat 420 

tatatcacat acctgtccat ctttgaccaa ttatttgaca ttcctaaaga aaggaagaat 480 

gcagagtata agagatacct agagatgctg cttgagtacc ttcaggatta cacagataga 540 

gtgaagcctc tccaagatca gaatgaactt tttgggaaga ttcaggctga gtttgagaag 600 

aaatgggaga atgggacctt tcctggatgg ccgaaagaga caagcagtgc cctgacccat 660 

gctggagccc atcttgacct ctctgcattc tcctcctggg aggagttggc ttctctgggt 720 

ttggacagat tgaaatctgc tctcttagct ttaggcttga aatgtggcgg gaccctagaa 780 

gagcgagccc agagactatt cagtaccaaa ggaaagtccc tggagtcact tgatacctct 840 

ttgtttgcca aaaatcccaa gtcaaagggc accaagcgag acactgaaag gaacaaagac 900 
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attgcttttc tagaagccca gatctatgaa 
ctcactcatg aaaatgtaca gcgcaagcaa 
gaagaagagc agatcagtga gagtgagagt 
cccaaaaacc tgccacttgg ctgggatggc 
catggcctaa atatcaacta caactgtgag 
aaagccttcc agcgacactt tgctgaatgg 
atcccaaaca ctgctcactt tgctaatgtg 
gccaaactga aattgcagaa ggcttcagaa 
gaagactcaa gtgggaatgt tgtgaataag 
ctgctctagt gttgagggat gtagctcagc 
tgctttttct atttctccca accaaatcct 
gtctagcatg catcttgtag aaacaaggca 
ttatctgttt tatattttaa aagattctgc 
cccagggtta tggaccaact cagtgcttca 
caactttact agtagctgag atttaatggg 
aatctgacct gacctctttc cccaccctcc 
cccaggatga ggtctgccat cagcttagtt 
aggaggatga gccagggttg ctactaagga 
atttgcataa agaaaggagt tggagctggg 
cttgggaggc tgaggcagga gggttgcttg 
gtctcattaa aaaaaaaagg catggtggca 
gaggctagaa gatcctttga acctaggagt 
ccc 



tatgtagaga ttctcgggga acagcgacat 960 
gccaggacag gagaagagcg agaagaagag 1020 
gaagatgaag agaacgagat catttacaac 1080 
aaacctattc cctactggct gtataagctt 1140 
atttgtggaa actacaccta ccgagggccc 1200 
cgtcatgctc atggcatgag gtgtttgggc 1260 
acacagattg aagatgctgt ctccttgtgg 1320 
cgatggcagc ctgacactga ggaagaatat 1380 
aagacatacg aggatctgaa aagacaagga 1440 
ttttgggcta gcccaggctt ccctaagatc 1500 
cttaaagacc ctttgctatg tagtctcatg 1560 
tgctggcaga ttgcagggtt gagatgtgtt 1620 
cagaaaataa aaccagacct tgttctaaag 1680 
ggtcttaatg cctccatacc tcttcctcac 1740 
cacctattat gctacatatc atgttaggta 1800 
tttgttgctg cttccctgaa tgagtattac 1860 
agccattgat gcaaatacta gggaaagact 1920 
ctaagtgtcg caccaaggtt tgccttttgt 1980 
tgcagtggct tgtgcctgta gtcccagcta 2040 
agactagcct aggtaacata gtgagaccct 2100 
cgcactgtag tcccagctac tcaggagact 2160 
ttgagaccag cctgggcgat atagtgaggc 2220 

2223 



<210> 85 
<211> 956 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_ feature 

<223> Incyte ID No: 7353336CB1 



<400> 85 

agcggaatga acggcggcgg gaggtgaaat 
ctataaaaac tttataaacc ccccggagcc 
ccccggaccg accaaagccc gcgcgccgct 
gccgtcgccg ccgccaccat gcccaagaga 
gcaaaggtga aggacgaacc acagagaaga 
ccaaagccag agcccaagcc taaaaaggcc 
gggaaaaagg gaaaagctga tgctggcaag 
gccaaaacag accaggcaca gaaagctgaa 
cagaaagctg aaggtgctgg agatgccaag 
cttctggtga ctgtacagtt tgaaatacta 
ttttgtttta cttttttttt ttttttaaaa 
ggtggttttg gggggaaggg gcatatgtca 
gtggagaaaa cacctttccc ttctagtttg 
ggattccctg actttgacac acatggccac 
aacaaattgg tgtttatgtc ctcttctccc 
taagccccag actctggttg agacctgacc 



ccggttctaa ccggtccggg gctcccagcg 60 
cgagcagtgt gaagaagagg cgagaacgac 120 
gcatcccgcg tccagcacct acgtcccgct 180 
aaggctgaag gggatgctaa gggagataaa 240 
tccgcgaggt tgtctgctaa acctgctcct 300 
cctgcaaaga agggagagaa ggtacccaaa 360 
gaggggaata accctgcaga aaatggagat 420 
ggtgctggag atgccaaaac agaccaggca 480 
tgaagtgtgt gcatttttga taactgtgta 540 
ttttttatca agttttataa aaatgcagaa 600 
gctatgtggt tagcacacag aacacttcat 660 
ctaatagaat gtctccaaag ctggattgat 720 
gagagacttc ctctgggctc ccaggaggag 780 
ctgtggcaca aaagcctggg ggtatagaaa 840 
tttcatcttt cagcatagac ttaatccctt 900 
ctagtcatgg gttccccgtg tgtcgg 956 



<210> 86 

<211> 2633 

<212> DNA 

<213> Homo sapiens 



113/141 



WO 03/054219 



PCT/US02/41115 



<220> 

<221> misc_feature 

<223> Incyte ID No: 3001652CB1 

<400> 86 

tgaggaaaga caggaaagcg ccagcatctc 
cagaaagggt ttcccatggg gccgcccctg 
gccgggcccc ggaggacgag ggaaagcagg 
gtgtacttct ctcccgagga gtgggaatgc 
gacgtgatgc gggagacctt cggccacctg 
gcttttatct cgtgggtgga aggagaagtg 
gacggtgaga gctctgcagc tttcagcagg 
gggaatgagg agaaggaaag gctgaagaag 
gtggctgtca aggagtggtg gcccagcgtc 
agccccatga atccctggct caaggacact 
gactgtggcc gcaacttcag ctacccttcc 
ggggagcggc ccttctcctg cggccagtgt 
ctccagcatc agttcatcca caccggcgag 
cgcttccgcc agaggggttc cctggctatc 
tacgcgtgct cagactgcaa gagtcgcttc 
cgcaagcaca cgggcgagaa gccctacagc 
acctccctgc tggccatcca caggcgcata 
gactgcggcc gccgcttcac ctattcttcc 
gacagccggc ccttcccctg cgtggagtgt 
gaagcccatc ggtggatcca ccgctcctgc 
gtggggcgtt cagagcccat ccctgttttg 
cactttccag atatatttca agagttctgt 
tcaaatgccc cgccagtccc aggccaatca 
tcatcggccg ttgcgtattg cgggtacaga 
tttcttgaag gcaagaagcc tctcctctac 
aaagcagccg aagcggcttt atttaagggc 
ccgcccccag ccccgctcct ggcgccgagg 
cccgggactg tgagcttcgc ggacgtggcc 
ctgcggcccg cgcagagggc tctgtaccgg 
ggcgcgctcg gattcccagg ccccaaacca 
gaggcttgga gccccgccgc ccaggatcct 
agaggagatg tcccaaacag gaaggaagag 
cctagaaagg ctcctgtgaa ggagagtcct 
gctattagcg tggccccggc acgggcacag 
acaggagcac agcccccggc acccataccc 
cacgtgtgca cggactgcgg ccgccgcttc 
cgcatgcact cgggggagcg gcctttcccc 
aagttcgcag tggaagcgca ccagtggatc 
cggaggcctg ggatccgggc tgtgcctcgg 
gtgctcttcc ggcactaccc agacatcttc 
ggagttgagc ctgaccttgg cacgaaggac 
ggggactccg gaactgaaat tcatgccctg 
acttgtaaaa agaaaagtgc ctgtaaagat 
ccagtctttt gtttaacaaa aaaaaaaaaa 

<210> 87 
<211> 4521 
<212> DNA 

<213> Homo sapiens 
<220> 



caccttcccc ggaagcctcc ctttgccagg 60 
gcgccgcgcc cggcccacgt acccggggag 120 
ccgggcgccg tgagcttcgc ggacgtggcc 180 
ctgcggccag cgcagagggc cctgtaccgg 240 
ggcgcgctgg gattttcagt tcccaaaccc 300 
gaggcgtgga gcccggaggc ccaggatccc 360 
ggccaaggac aggaagcagg atccagggat 420 
tgtccaaaac aaaaagaggt ggcgcatgaa 480 
gcctgcccag agttctgcaa ccctaggcag 540 
ctgacccgaa gactgcccca ctcttgccca 600 
ctcctggcca gccaccagcg ggtccactcc 660 
caggcgcgtt tctcccagcg caggtacctg 720 
aagccctacc cctgccccga ctgcgggcgc 780 
cacaggcggg ctcacaccgg ggagaagcct 840 
acttacccct acctgctggc catccaccag 900 
tgccccgatt gcagcctccg tttcgcctac 960 
cacaccggcg agaagcccta cccctgtcct 1020 
ctcctcctca gtcaccggcg cattcactcc 1080 
gggaaaggct tcaagcgcaa gaccgccctg 1140 
agcgagaggc gcgcgtggca gcaggccgtg 1200 
ggaggcaagg atcccccagt tcacttccgg 1260 
caacagaggc ttcaggaccg cggggtccct 1320 
ccgcgcagct tcttccggga tcgtcgccaa 1380 
ggggtaagtg aagcttcggg cccttacatc 1440 
ttcccagaca ccccgcctcc cccactagaa 1500 
aagtgggacg atgaggccag agaaatggcg 1560 
cccggggaga cccggcctgg ttgcaggaag 1620 
gtgtacttct ccccggagga gtggggctgt 1680 
gacgtgatgc aggagaccta cggccacctg 1740 
gccctcatct cttggatgga acaggagagt 1800 
gagaaggggg aaagactggg aggagctcgg 1860 
gaaccggagg aagtcccaag agccaaaggg 1920 
gaagtgctgg tggaacgcaa ccctgaccca 1980 
ccacccaaaa atgctgcctg ggacccgacc 2040 
agcatggatg ctcaggccgg ccagcggcgc 2100 
acctacccct cactgctggt cagccacagg 2160 
tgccccgagt gtggcatgcg cttcaagagg 2220 
caccgctcct gctccggggg gcggcggggc 2280 
gcccccgtcc gaggtgaccg ggacccgcct 2340 
gaggagtgcg gctgagcggc accgcaggct 2400 
tgacggatcc ctgaggtggg ccactgagtc 2460 
ggctttcctc aaggatccct caagtttcca 2520 
tcgaatagat tagacttgcc acccatctcc 2580 
aagggccgct cgcgatctag aac 2633 
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<221> misc_feature 

<223> Incyte ID No: 1689128CB1 

<400> 87 

gcatctagaa ccacacaccg gagaaggggt ttccccctgc ccgcccgccc gccccccaac 60 

tacctccccg ctcctgccag tgtctgcctc tctgccccca cgggggttta tttgatctca 120 

cattaaccaa ggaggagaat gtgagaaaag ggggaaaatg cccaggagga agcaggagca 180 

gcccaagcgc cttccctcac atgttagtag gcaggaagag gcggagggag agctcagtga 240 

aggagaacac tggtatggaa actcttcaga gactccgtca gaagcatcct atggggaagt 300 

ccaggagaac tataagttgt ctctggagga ccggatccag gagcagtcca cgtccccgga 360 

cacctccttg ggaagcacga ctcccagcag ccacacgttg gagctggtgg cactggacag 420 

tgaggtcctg cgagactcac tgcagtgtca agatcacctt tccccgggag tgtcttcttt 480 

gtgtgacgat gacccaggct ccaacaagcc cctgagcagc aatttgaggc ggctgctgga 540 

ggctggttcc ctcaaactcg atgctgcagc cacggctaat ggcagagtgg agtcccccgt 600 

aaacgttggc tcgaacctct ccttttcccc gccttcccac cacgcccagc agctcagtgt 660 

tctggccagg aagttggccg agaagcagga acagaatgac caatacactc caagtaaccg 720 

ttttatatgg aatcaaggta agtggttgcc aaactcaacc accacctgca gcttgtctcc 780 

ggattcagcc atcctaaaac tgaaagctgc agccaatgcc gttctgcagg acaaatctct 840 

caccaggact gaggagacca tgcgatttga gtccttttcc tcccctttta gctcccagtc 900 

tgccagttct accttggccg ctttgtccaa gaaggtcagt gaaagaagtt tgactcctgg 960 

tcaggagcac cctccaccgg ccagctcctt cctttccctg gcttctatga cctcctcagc 1020 

ggcccttctg aaggaggtgg ccgcaagggc tgcgggcagt cttctggctg agaaatcatc 1080 

gctgctgcct gaggaccctc taccgccccc gccttcagag aagaaaccag aaaaagtcac 1140 

tccgccacct ccaccgccac ctccaccacc tccaccacca ccaccacaat ccctggaatt 1200 

attattactc ccagttccta agggaagagt ttctaaaccc tccaattcag cctcagaaga 1260 

ggaaagtgga aagcctttcc agtgtccaat atgtggtttg gttattaaaa ggaagagtta 1320 

ctggaagcgg cacatggtga ttcacacagg tttaaaaagt catcagtgtc cgctctgtcc 1380 

attccggtgt gctcgcaagg acaatctcaa atcccacatg aaggttcatc agcaccagga 1440 

tcggggagag acctttcagt gccagctgtg cccttttact tcctcacgcc acttcagcct 1500 

gaaactccac atgcgttgcc atcagcactt cctgaggaca gaagccaagg tgaaggagga 1560 

gatcccagac ccagatgtca agggatctcc ccacctcagt gacagtgcct gcctggggca 1620 

gcaaagggaa ggaggaggga cagagctagt ggggaccatg atgacgtcta acactccaga 1680 

gaggactagc cagggagggg ctggcgtctc gcctttgctg gtgaaggagg aacccaagga 1740 

agataacggc ctgcccacct cctttacttt gaatgctgcc gacaggcccg ccaaccacac 1800 

aaagctgaaa gacccctccg agtatgtggc caacagtgcg tcagcattgt tcagccagga 1860 

catctctgtt aagatggcgt ctgattttct catgaagctg tcagctgcaa atcagaagga 1920 

gcccatgaat cttaatttta aagtgaaaga ggagcctaag gaaggggagt ccctaagcac 1980 

gactttgcct cggtccagct atgtgttcag cccggaatct gaagtgtcag ccccaggcgt 2040 

ctctgaggac gcactaaagc cccaggaagg gaagggaagt gtgctaaggc gggatgtgtc 2100 

agtcaaagca gcctctgaac ttctcatgaa actctcagcg gaaagctaca aggaaacaca 2160 

gatggtgaag attaaagagg aacccatgga ggttgacatc caggactccc atgtctcgat 2220 

atcacccagc cggaatgttg gctacagcac tttaatcggg cgagagaaaa ccgaaccctt 2280 

acagaagatg ccagagggca gagtaccccc agagagaaac ctcttcagtc aggatatctc 2340 

tgtgaaaatg gcttccgagc tcctctttca actgtcagaa aaagtgagca aagagcacaa 2400 

tcatacaaaa gaaaacacca tccggaccac gaccagccct ttcttttcag aagacacatt 2460 

tagacaatca ccattcacct ccaattcaaa agaactgctg cccagtgact ccgtgctgca 2520 

cggaagaata tcagctccag aaacagaaaa gatagtccta gaggcaggaa atggattacc 2580 

atcctggaaa ttcaatgacc agctttttcc ctgtgacgtg tgtgggaaag tgtttggccg 2640 

acagcagaca ttgtcccgac acctctcgct gcacacagag gaaagaaaat acaaatgcca 2700 

cttgtgcccc tatgctgcta agtgccgtgc aaatctgaac cagcacttga ccgtccattc 2760 

cgtgaagctg gtgagtacag acaccgagga cattgtcagc gccgtcacct ctgaaggcag 2820 

tgatgggaag aaacatcctt attattacag ttgtcacgtg tgtggatttg agaccgagct 2880 

caatgtccag tttgtcagcc acatgtcact ccacgtggac aaggagcagt ggatgttttc 2940 

gatctgctgc actgcctgcg acttcgtcac catggaggaa gcagagataa agactcacat 3000 

tggcaccaag cacacagggg aagacaggaa gacccccagc gaatcaaata gcccctcttc 3060 

atcctccctc tcagctctga gtgattcagc caacagcaaa gatgattcag atggctccca 3120 
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gaaaaacaag ggcgggaaca atctgctggt 
actgaacagt gaggaaaagc cagagaaagg 
caagacgaag aacatgtttg agcgtcatct 
gtgtgatgtg tgccacaagt tcatgaagac 
ccacactgtc cccaccggtg ggctcaactc 
gtgcccattc tgcatttatt ccaccaaccg 
ccactacaag atggagtaca agtgccggat 
gctggagacg cacacccggg agcaccgcct 
ctacctgtcc aagaccgcca acaagctcat 
gcccttccac tgtgaccagt gcagctacag 
caagaagctg aagcacgccc cacgccagac 
cacacaccct ttcgtcttca gccgccacgt 
ggaggacaag aagggcctgt gtccagcccc 
cctggtggtc gggagctccc ggaatctcct 
ggctctgcag accgtggccc tgtcggcagc 
actcaaggct ttggccttca acggctcccc 
ttttgcccat ctcattccct tgacaatgtt 
ccaccctccc cgacctcaga ctgcgcctcc 
ggcctatctc ggactgagag aaagagcaga 
aaaaacagag agacaaaaga caaaaaaaaa 
cagcaggtgt atgttgctgc aaaacctaca 
gtaactctcc tagtaaggaa tagaaaattg 
gaaagcggct ttatccaatc ttcagagagg 
catgctcccc caggcaccaa g 



catctctgtc atgcctggga gccagccctc 3180 
gttcgaatgt gttttttgca actttgtctg 3240 
gcagatacac ctcatcaccc ggatgtttga 3300 
ccccgaacag ctgctggagc ataagaaatg 3360 
aggacagtgg tgagtttcag actcctctag 3420 
ccccgctgcc atggagtgcc acctcaagac 3480 
ctgccagacg gtgaaggcca accagctgga 3540 
gggcaaccac tacaagtgcg accagtgcgg 3600 
cgagcacgtg cgcgtccaca ccggggagcg 3660 
ctgcaagcgc aaggacaatc tcaacctgca 3720 
cttcagctgc gaagagtgcc tgttcaagac 3780 
caagaagcac cagagtgggg actgccccga 3840 
caaggaaccg gccggcccgg gggccccgct 3900 
gtctcccctg tcagttatgt ctgcctccca 3960 
ccacggcagc agctcagagc ccaacctggc 4020 
tttgcgcttt gacaagtacc ggaactcaga 4080 
ataccccaag aaccacttgg atctcacatt 4140 
cagcatcccc tcacccaaac actccttcct 4200 
gactgtctga gggcagccat gttctgtacc 4260 
aaaaaaaacc acaaaactta aacacaaccc 4320 
gaccccgatg ggtctgtgaa catgtgtact 4380 
gctctgtgtg tatacctatt gcattgacct 4440 
tgacctactg gatacttcta ccttcagagg 4500 

4521 



<210> 88 
<211> 3907 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 2362969CB1 



<400> 88 

ttttgcggcc cccccttttg tgtatttttt gtctttgagt gacaccagag ctgggttcac 60 
ggtgatggct ctcccataat tgtgggcttt gcttgaatgg gttttaacgg aggactcttt 120 
ggtttagcct gaacaacagt tgttgtgagg gaggccagtc cagggtattg tcccagtggt 180 
ggtcgttgta aaggttgcag tgggtatagc gattagctgc gggggagagg ggcacggggc 240 
agagggaaga gagcaaggtc tgctgctgct gccccgggct ccgaggccag cttcacagag 300 
tccagagggc tgcagaataa gaacagaggg ggggccaatg ggaaagggag gcggggcagc 360 
ctcaatgcca gcggacgaag gacaccccca aattgtgctg ctgaggatat caaagccagc 420 
ccttcctcca ccaacaaaag gaaaaacaag cctccaatgg agctggacct gaactccagc 480 
tctgaggaca ataagcctgg aaagcgtgtc cgcacaaatt ccagaagcac tcccactacc 540 
cctcaaggga aaccagagac tacttttttg gaccaaggct gctcttctcc agtgttaatc 600 
gactgtcccc acccaaactg caacaaaaag tacaagcaca ttaacggcct gaggtaccac 660 
caggctcatg cacacttaga cccagaaaac aagctggagt tcgagcctga cagtgaggac 720 
aagatctcgg actgtgagga aggattgagt aatgtggcac ttgaatgcag tgagccaagc 7 80 
acaagtgtat ctgcttatga ccagttgaag gcaccggcat cccctggtgc tggaaaccca 840 
cctgggaccc caaagggaaa gagagagctg atgagcaatg gcccaggttc cattattggt 900 
gctaaagctg ggaagaattc tggcaaaaag aagggcctta acaatgaact gaacaacctt 960 
ccagtaatct ccaacatgac ggctgcgtta gacagttgct cggcagcaga cggcagtttg 1020 
gctgctgaga tgcctaaact ggaagcagaa ggattaattg acaagaaaaa tttaggagat 1080 
aaagaaaagg gcaaaaaagc taccaactgc aaaacggaca aaaacctctc taaactgaaa 1140 
agtgcccggc ccattgcccc tgccccagcc cccactcccc cgcagctaat cgctataccc 1200 
actgcaacct ttacaacgac caccactggg acaatacccg gactgccctc cctcacaaca 1260 
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actgttgttc aggctacacc aaagagtcct ccgttaaaac ccattcaacc aaagcccaca 1320 

attatgggag agcccatcac cgtgaaccca gctctggtgt cactcaaaga caaaaagaaa 1380 

aaggagaagc gaaagctaaa ggacaaagaa gggaaagaga cgggaagccc aaaaatggat 1440 

gccaagctgg ggaaactaga ggactccaag ggggccagca aagatttacc tgggcatttt 1500 

ttaaaggatc atctcaacaa gaatgaaggg ctggcaaatg gactgtcgga gtctcaggag 1560 

agccgcatgg ccagtatcaa agctgaggcc gataaggttt acactttcac agacaacgct 1620 

cccagccctt ccatagggag cgcctcgagg ctggaatgca gcactttggt gaacgggcag 1680 

gcaccaatgg cacctctgca tgtgttgacc cagaatgggg cagagagttc agctgcaaag 1740 

acaagcagcc cggcctattc agacatatct gatgctgctg acgatggtgg ttctgacagc 1800 

aggtcggagg gtatgagatc aaaggccagt tccccatcag atattatttc tagtaaggac 1860 

agtgttgtaa aagggcattc ttcaactaca gcacaatcat ctcaactgaa agagtcccat 1920 

tctccctatt accacagcta tgatccttat tattctccaa gttacatgca ccctgggcag 1980 

gtcggtgccc ctgcagctgg aaatagtggg agcacgcagg gaatgaagat caagaaggag 2040 

tctgaggaag atgctgaaaa gaaagacaag gcagagcagt tagattctaa gaaagtggac 2100 

cacaattctg catccttaca gcctcagcac cagtcggtga tcacacaaag acatcctgcc 2160 

ctggctcagt cactttatta tggccagtat gcatatgggc tctatatgga ccagaagtct 2220 

ctgatggcca ccagccctgc ctatagacag cagtatgaga agtactatga ggaccagagg 2280 

ctggcagagc agaaaatggc ccagactggg agaggagact gtgaaaggaa aagtgagctc 2340 

cccttgaaag agctgggcaa ggaggaaact aaacagaaaa atatgccatc ggccacaatc 2400 

tcaaaagctc cctctactcc ggagcctaac aaaaaccatt ctaaactagg gccatcagtg 2460 

cctaataaaa ctgaggagac aggtaaatcg cagcttctct ccaatcacca gcagcagctt 2520 

caggccgaca gcttcaaagc taagcagatg gaaaaccacc agcttattaa ggaggctgta 2580 

gaaatgaagt ctgtcatgga ctctatgaag cagacaggtg tagacccaac ctcgagattt 2640 

aaacaagacc cagattcaag aacatggcat cattatgtat accagcccaa atatctggat 2700 

cagcaaaagt cagaagaact tgatagagag aagaaattaa aagaggatag tccgaggaaa 2760 

actcctaata aagagagtgg tgtgcccagc cttcctgtat cgttaacaag cattaaagag 2820 

gagcccaaag aggccaagca tcctgattct caatcaatgg aggagagcaa gctgaaaaat 2880 

gatgatcgaa agactcctgt gaactggaag gactctcggg gaacaagagt ggctgtctcc 2940 

tcacccatga gtcagcatca gtcatacata cagtacttgc atgcttatcc ttacccacag 3000 

atgtacgacc ccagccatcc tgcataccgg gctgtttctc ccgtcctaat gcacagttat 3060 

cctggggcat atctctctcc aggatttcat tatcctgttt atgggaagat gtcagggaga 3120 

gaagagacag agaaagtcaa taccagccct agcgtcaaca cgaaaacaac cactgaatct 3180 

aaagcactgg atttgctcca gcagcatgct aaccaatacc gcagcaagtc tcctgctcct 3240 

gtggaaaagg ctacagctga gcgggaacgg gaggcagaaa gggaaaggga tcgccactcc 3300 

cccttcggcc agcggcacct gcacacgcac caccacaccc acgttggcat gggttacccg 3360 

ctaatcccgg gtcaatatga cccttttcaa ggcttgacct ctgctgccct tgttgcctct 3420 

cagcaggtgg ctgcccaggc atctgcatct ggaatgtttc ctggacaaag aagagaataa 3480 

caagattgga catgtacatt gtcatgtgac tggatcacat ggggaagcgc tttatacaga 3540 

catcagttcc atggtgttaa tttgaaatgc cttaatggca ggcaaaaacc agtcatttca 3600 

tttttgtaaa ttacagattt tatcacagag taacaaatgt tgctgtaatt aaacttctgt 3660 

tttatatata tatacgtata catatctgta tatatacata tatatatata taaatataaa 3720 

tatattcttt actttttgta tcagtaacca ggctcgcaca cacagggtct gctgccacgt 3780 

cgcaaaccac tcagtaaagg aaattaaaaa tgtatttgtc atgttaaaaa gtgttagcca 3840 

caaaaggcta cttactttct tttccttttc ctttaaaatt gtaaatagat aatgttatgt 3900 

atcaggg 3907 



<210> 89 

<211> 1802 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 4753527CB1 



<400> 89 
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caaggtcgag agagggaggc ggcggtgggg aggaggagga ggaggaggag caggcgcccg 60 
ccatggccgc cgctatcacc cgacatggcc gacctggagg agctctcccg cctgagccct 120 
ctgccccccg gcagcccggg ttcggcggcc ggggccgggc tgagcccccc gaggaggagg 180 
aggaagagga ggaggaggaa gaggaggcgg aggccgaggc ggtggcggcg ctgctgctga 240 
acggcggcag cggtgggggc ggcggaggcg gcggcggagg agtggggggc ggcgaggcag 300 
agacgatgtc ggagccgagc cccgagagcg ccagccaggc cggggaggac gaagacgagg 360 
aggaggacga cgaggaggag gaagatgaga gcagcagcag cggcgggggt gaggaggaga 420 
gtagcgccga gagcctggtg ggcagcagcg gcgggagcag cagcgacgag acccgctcgt 480 
tgagccccgg cgccgccagc agcagcagcg gggatgggga cggcaaggag ggcctggagg 540 
agcccaaggg accgcggggc agccagggcg gcggcggggg cggcagcagt agcagcagcg 600 
tagtctccag cggcggcgac gagggctacg ggactggggg aggcggaagc agcgcgacct 660 
ccgggggccg gcggggcagc ttggagatgt cgtcggatgg ggaacccctg agccgcatgg 720 
actcggagga cagcataagc agtactataa tggatgtaga cagcacaatt tccagtgggc 780 
gttcaactcc agcaatgatg aatggacaag gaagcactac ttcttcaagc aaaaatattg 840 
cctataattg ttgttgggac cagtgccagg cttgcttcaa ctctagccca gatctggcag 900 
atcacatccg ttccatacat gtagatggtc agcgaggagg ggtatttgtt tgcttatgga 960 
aaggttgtaa agtatataac actccatcta ccagtcaaag ttggttacaa aggcatatgc 1020 
tgacacacag tggagacaaa cctttcaagt gtgttgttgg tggctgcaat gccagctttg 1080 
cttctcaggg agggctagct cgtcatgtac ccacacactt cagtcagcag aactcctcaa 1140 
aagtttctag ccagccaaag gccaaagaag aatctccttc taaagctgga atgaacaaaa 1200 
ggaggaaatt aaagaacaaa agacgacgct cattaccacg gccacatgat ttcttcgatg 1260 
cacaaacact ggatgcgata agacatcgag ccatatgctt taacctctca gctcatatag 1320 
aaagtttagg gaagggacac agtgttgttt ttcatagtac tgtaatagct aagagaaaag 1380 
aagattctgg gaagatcaaa cttttgcttc attggatgcc tgaagacatt ctgcctgatg 1440 
tgtgggtgaa tgaaagtgaa cgacatcagt taaaaactaa agtagttcat ttatcaaagc 1500 
tacccaaaga tactgccttg cttttggacc caaacatata cagaacaatg ccgcagaaga 1560 
ggttgaagag atttgacatc ctcaatttcc caaggtgatg agtgcttctt catctggaac 1620 
ctcacagcca tgagaacgga aaggggacga cttattttaa ccggttacag cgtaaacaca 1680 
catctccaca ggtgtctcct ggctgctatt gccctaaggt cttgggttac ctcacctgtt 1740 
ccatcgtgtg gtttcctagt aacccagaaa tttatctaag tgagactctc aggatagact 1800 
ta 1802 



<210> 90 

<211> 2469 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 6928688CB1 

<400> 90 

cggctcgagg cagtggagac aggtcatcgg 
attttagcca ggtggcaagt gtatggactg 
acttgggttc tggcctatag gaacaagaca 
ctggggaaga tcatcatttc cagtggtagc 
ctgggaaggc ccagaataga cagaagagag 
gttgaacgaa atgggcaagg cccagctgaa 
ttgtaatgtc tgtgaggaga gagcaaggca 
aagatgtggc tgtgggcttc accagggagg 
tcttgtacaa ggaagtaatg ttggagaact 
gcaccaagcc agattcgctc ttcaagttgg 
gagcagccca cagtcaaatc tgtccaggtt 
aagattctga tgcatttggt ggatatggga 
ctcttactgg agttaaatac catagatgtg 
atgacctaca aaaaatctgt gcaggaggga 



tctgttgagc atttttgtca tctgcaaata 60 
tggagtaaga agtacctgcc tctgcctctg 120 
ttttcagtaa tggaagaagg agcttccagg 180 
ttgacctgag atgatacctg accaccactt 240 
agaacatact agagttgcta gcccctaaga 300 
ataacaggaa acacatgaaa aatgccacga 360 
gttcttcagg ggaggggtca ttgtcattcg 420 
agtggcagtt tttggaccag tctcagaagg 480 
acatcaacct agtatcaata gggtatcgag 540 
agcaaggaga acccccaggg atagcagaag 600 
ttgtaatcca gagtagaaga tatgcaggaa 660 
gatcatgcct ccatatcaag cgtgacaaaa 720 
ttaaacccag cagccctaaa tcacagctca 780 
aaccacatga atgcagtgtg tgcgggagag 840 
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ccttctccag gaaagcacag cttattcaac 
atggatgtgg tgaatgtggg aaaacattca 
gaactcacac aggagagaag ccccatgaat 
agtcacagct catggtccat cagagaaccc 
aatgcggaaa agccttcagc cggaagtgcc 
gagagaaact ctatgggtgc agtgtgtgtg 
ctgcacacca gagacttcac acaggagata 
ccttctattt taagtcagac ctgaccagac 
atgaatgtag tgagtgtgaa aaagccttta 
gtactcacac tggagagaga ccatattcat 
tgtcagtcct cattaaacat gagaaaactc 
cggtggagaa accttcctca aggagtcata 
agcaaaagac tgtgaacaca gtacctatag 
taaacaagag tgagcgccta gtgggcagaa 
gaaatcaagc ctttgtagtt aatcaggaat 
tgaatgtggc cccatcagta ataaattata 
aaatcttctg ataccagaaa ggtgggaaaa 
cccagagagc tcatttaggg ctgaaacact 
gtgagaagtc aaacctgtta aacaccattg 
tgaggaagtc tttacctgga gatgatatct 
tatttatgaa agattgtgat ggtgctttta 
acatagacaa ctgtagaggc aatggatatg 
ttatagaaaa gaaaaactct tgaaagtaag 
gtatcatact tttatgtctt gagagtcttg 
caggaatagt attatgtaat ggaaacctat 
gttcttatga gtgtgaccaa attacacaaa 
cggggggggg ccacaggttc aggggaacaa 
ggcacatag 



atcagagaac tgaaagagga gagaaacccc 900 
tgaggaagat tcagctcact gagcatcaga 960 
gtagtgaatg tggaaaagcc ttctccagaa 1020 
atacaggaga gaaaccctac agatgcagcg 1080 
ggctcaatag acatcagcga tcacatactg 1140 
ggaaagcctt ttctcagaag gcatacctca 1200 
agccttataa atgcagtgat tgtggaagaa 1260 
atcagaggat tcatacggga gagaaacctt 1320 
gaagcaagtc gaagctcatt cagcatcagc 1380 
gcagagaatg tggcaaagcc tttgcccaca 1440 
acataagaga gacagccata aattcactga 1500 
cctccttata catgagcgaa ctcatacaag 1560 
aaatgccttc ctcaggaacc ccgccattgt 1620 
atgtagtgat tgtggaacaa ccttttccaa 1680 
ttgaacagag aataagcctc acaaatgaag 1740 
tcttgtatct tacagatatt gtatcagaat 1800 
tctttagtag gaatcctcca gccattatat 1860 
gtggaggcag tggttgtgga ggacatttct 1920 
aaatcatgtt gggtagaaat gcagctgtta 1980 
caattggtat caaaagaatt cttacaggaa 2040 
gcgctaaatt gtgcctcgtg tgccaaagaa 2100 
gaaaacattt ttcgtaaaat atgggagaat 2160 
aaatgatggc aactcaccaa attaactcat 2220 
gagggacgtg tttaatgaaa tcttgaaccc 2280 
gaatatacaa atatggagta aaatgtgtta 2340 
tttagggtta aaaacaaatc tttcttcagt 2400 
aggggaaggg gggatggggg gggagggttg 2460 

2469 



<210> 91 

<211> 3878 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 7506388CB1 



<400> 91 

tccccccttc ccctccctcg ctgcccctgg cccagcggga gccccccccc cccccccagt 60 
gcgcctgtgc ggaggccctc ttgattatgt gtgccctctc cgggcgcccg cgttaccccc 120 
gggtggaggt ggggagggaa gacgctgagg aggaggagga ggcggaggag gcggtggagg 180 
ggaggtgggg ggaatcagca aggacatggc tcctgactcc tgtgcggaac gtgagtgact 240 
gagcggcaaa gcccgaatgg tctctagcga ccggcccgtg tcactggagg acgaggtctc 300 
ccatagtatg aaggagatga ttggaggctg ttgcgtttgc tcagacgaga gaggctgggc 360 
cgagaacccg ctggtttatt gcgacgggca cggctgcagc gtcgcggtgc atcaagcttg 420 
ctatggcatt gttcaagtac ccactggacc gtggttttgc aggaaatgtg aatctcagga 480 
gagagcagcc agagtgagat gtgaactttg tccccataag gatggagctt taaaaagaac 540 
agataatggg ggttgggccc atgtggtttg tgccctgtat attccagagg tacaatttgc 600 
caatgtttcc acaatggaac caattgtttt acagtctgtt ccgcatgatc gttataataa 660 
gacttgctac atttgtgatg aacaaggaag agaaagcaaa gcagccactg gtgcttgcat 720 
gacatgtaat aaacatggat gtcgacaggc tttccatgta acatgcgctc agtttgccgg 780 
actgctttgt gaagaagaag gtaatggtgc cgataatgtc caatactgtg gctactgtaa 840 
ataccatttt agtaagctga aaaagagcaa acggggatct aataggtcat atgatcaaag 900 
tttaagtgat tcttcctctc actctcagga taaacatcat gagaaagaga aaaaaaaata 960 
taaagagaag gacaaacaca aacagaaaca caagaagcag ccagaaccat cacctgcatt 1020 
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ggttccatcc ttgactgtta ctacagaaaa aacttataca agcactagca acaactctat 1080 
atctggatca ttgaagcgct tggaagatac tactgcacga tttacaaatg caaatttcca 1140 
ggaagtctct gcacacacct ctagtggaaa agatgtttca gagactagag ggtcagaggg 1200 
caaagggaag aaatcttcag ctcacagctc aggtcaaagg ggaagaaagc ctggtggtgg 1260 
aagaaatcca ggaacaactg tgtcagcagc tagccctttt cctcaaggca gtttttcagg 1320 
aactccaggc agtgtaaagt catcttctgg aagttcagtg cagtctcccc aggatttcct 1380 
gagctttaca gactcagatc tgcgtaatga cagttactct cactcccaac agtcatcagc 1440 
aaccaaagat gtacataaag gagagtctgg aagccaggaa gggggggtaa atagttttag 1500 
taccttaatt ggcctccctt caacctcagc tgttacttca cagcctaaaa gctttgaaaa 1560 
ttcacctgga gatttgggta attccagcct tcctacagca ggatataagc gggctcaaac 1620 
ttctggcata gaagaagaaa ctgtaaagga aaagaaaagg aaaggaaata aacaaagtaa 1680 
gcatgggcct ggcagaccca aaggaaacaa aaatcaagag aatgtttctc atctctcagt 1740 
ttcttctgct tcaccaacat catctgtagc atcagctgca ggaagcataa caagctctag 1800 
tctgcagaaa tctcctacat tgctcaggaa tggaagttta cagagcctca gtgttggctc 1860 
atctccagtt ggttcagaaa tttccatgca gtatcggcat gatggagctt gcccaacaac 1920 
tacgttctca gagttgctga atgcaataca caacggtatt tataacagca atgatgtagc 1980 
agtatcgttt ccaaatgtag tatctggctc gggatctagt actcctgtct ccagctctca 2040 
cttacctcag cagtcttctg ggcatttgca acaagtagga gcgctctctc cctcagctgt 2100 
gtcatctgca gcccctgctg ttgctacaac tcaggcaaat actctatctg gatcttctct 2160 
cagtcaggca ccatctcata tgtatggcaa tagatcaaat tcatcaatgg cagctcttat 2220 
agctcagtct gaaaacaatc aaacagatca agatcttgga gacaatagcc gcaacctagt 2280 
tggcagagga agctcacccc gaggaagtct ctcgccacga tcccctgtaa gcagcttaca 2340 
gattcgctat gatcaaccag gcaacagcag tttggaaaat ctgcctccag tagcagccag 2400 
catagaacag cttttggaga ggcagtggag tgaaggacag caatttttac tagaacaggg 2460 
tactcctagt gacattttag gaatgct.gaa gtcattacac caacttcaag ttgaaaaccg 2520 
aagattagag gaacaaatta aaaacttgac tgccaaaaag gaacggcttc agttattgaa 2580 
tgcacagctt tcagtgcctt ttccaacaat aacagcaaat cctagtccgt ctcatcaaat 2640 
acacacattt tcagcacaga ctgctcctac tactgattcc ttgaacagca gtaagagccc 2700 
tcatatagga aacagctttt tacctgataa ttctcttcct gtattaaatc aggacttaac 2760 
ctccagtgga caaagtacca gcagctcatc agctctttct accccacctc ctgctgggca 2820 
gagtccggct caacaaggct caggagtgag tggagttcag caggtcaatg gcgtgacagt 2880 
gggggcacta gctagtggaa tgcagcctgt aacttccacc attcctgccg tgtctgcagt 2940 
gggtggaata attggagctt tgccaggtaa ccaactggca attaatggca ttgtaggagc 3000 
tttaaatggg gttatgcaga ctcctgtcac aatgtcccag aaccctaccc ctctcaccca 3060 
cacaaccgta ccacctaatg caacacatcc aatgccagct acactgacta acagtgcctc 3120 
aggactagga ttactttctg accagcaacg acaaatactt attcatcaac agcagtttca 3180 
gcagttgtta aattctcaac agctcacacc agtacacagg cacccccact tcacacagct 3240 
accaccaacc catttctcac catccatgga gataatgcaa gtcagaaagt agcaagactt 3300 
agtgataaaa ctgggcctgt agctcaagag aaaagttgac acctgagaaa catctagaaa 3360 
ttgcctatcc tgctgttcta gcacttcatc tggctgcctt tgcagtcctt ttactacagc 3420 
tatgaagaaa cgcaacaaga aactcaatgc acaacaaagg attaattgct gcaaggacat 3480 
tcttgtaagg ctttgattag ttttcttgtt gctttgttgc actgaaatgg aattcccatg 3540 
cccctacccc ttaccccagt tttttgaaca tggaaagaaa atttaataac tttttaaagt 3600 
gacataattt acatgcaata tgtttatcaa ctcaagaatt taatatagtt ggtacacaac 3660 
tagttttgtt tataaattgg agatgcaaat agcaaaacta aatacttgct ccatttacaa 3720 
actacttgat tttattgtac aagttgaaat atgctctttt gtttgggtta cagtatgctt 3780 
gctctaagtc aaattccaag gaactaattt cttctcctgg agttgcattg attcagtatt 3840 
acaaatatat agcacatcac ctgggacttg gcaatctt 3878 

<210> 92 

<211> 5674 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_£ eature 
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<223> Incyte ID No: 7376372CB1 
<400> 92 

gggagagcga gactccgagc gtgcgggcga ggaagctggg cggggggccg agacgggggc 60 
cgggacgacg gctggggcgc cggggtcgcc agctcctcgc ccacctccct ccctgtggga 120 
gaagaggagg aggggaagtg actgcggagt tgatgaactt tgcacatcca gaaacgccat 180 
tttgattcct tttccggaac aagtttgcat ctctcctcgg tctctccctg aagcccaccg 240 
gtccccaacg catcatgcca gggagccaag cgtcctgccc ggccgttgct ccgactgtgt 300 
gttccaggag tggtggctct gaggtgtgac cctgcccacg tttgggccca gccaggCtca 360 
gccccccaat aaggagggca gcttgataac acaaagaaaa cagtgtcaac gagtactacc 420 
aagagaagaa gacaagaggt caacacagac tcattttcta ctccgtgtga atgatagcta 480 
cagcagggga aagtttcata gtctatcagt gggtcagaaa atggagtttt atagcagagg 540 
cttcttagaa gcttaaaccc ctgtcccaat gacgtcaagt tcgcccgctg gcttggaagg 600 
ttcagaccta tcttccatca acaccatgat gtcggcggtc atgagtgtag ggaaggtcac 660 
agagaatggc gggagccccc aggggatcaa gtccccctcg aagcctccag gaccaaatcg 720 
gattggcaga aggaaccagg aaacgaaaga ggagaagtct tcctataact gccccctgtg 780 
tgagaagatt tgcactaccc agcaccagct gaccatgcac attcgccagc acaacacaga 840 
cactggagga gccgaccact catgcagcat ctgcggaaag tcactgagct cggccagctc 900 
cctcgatcgc cacatgctgg tgcactctgg cgagaggcct tacaagtgca ctgtgtgtgg 960 
ccagtcattt accaccaatg ggaacatgca cagacatatg aagatccatg agaaggaccc 1020 
taacagtgcc acagccacag cccctccatc tcctctgaaa cgtaggcgat tgtcctccaa 1080 
gaggaaactg agtcacgatg ccgagtcaga gagagaagac ccagcaccag ctaaaaagat 1140 
ggtagaagac gggcagtcag gtgacttgga gaagaaagct gatgaagtct ttcactgccc 1200 
agtatgtttc aaggagtttg tttgcaagta tggactggag acccacatgg agacccattc 1260 
agataaccca ctaagatgtg acatttgttg tgtcaccttt cgaacacatc gaggactgct 1320 
gcgtcacaac gcgcttgtcc acaaacaact tcccagggat gcaatgggca gacctttcat 1380 
acagaacaac ccttcaattc ctgctggctt ccacgactta ggattcacgg acttctcctg 1440 
taggaagttt cctcgcattt ctcaggcctg gtgcgaaaca aacctgcgga ggtgcatcag 1500 
cgagcaacac cgttttgtct gcgacacctg tgacaaggcg ttccccatgc tctgctcagt 1560 
ggctctgcac aagcagaccc acgtggcggc agaccagggt caagaaaagc cgcaggccac 1620 
gcccctgcct ggtgacgccc tggaccagaa gggcttcctg gccttgcttg gcctgcagca 1680 
caccaaagac gtcaggcctg cccccgccga ggagcccctg ccggatgaca accaggcaat 1740 
tcagctccag acactcaagt gtcagctacc tcaggacccc ggctgcacca acctgctgag 1800 
cctgtcacct ttcgaagctg cttccctagg cggttctctc acagttctcc ccgcaaccaa 1860 
ggacagcata aagcacctgt ccctgcagcc cttccagaag ggcttcatca tccagcctga 1920 
cagcagcatt gtggtcaagc ccatctctgg cgagtcggcc atcgagctgg cagacatcca 1980 
gcaaattctg aagatggcag cctcggctcc ccctcagatc agtcttccgc ccttctccaa 2040 
ggcccctgcc gccccactgc aggcgatctt caagcacatg ccccctctga aggcaaagcc 2100 
cctggtcaca ccacggacgg tggtgggcca cctccacgcc cccgcctctc atcaacgcca 2160 
gcaggcttcc cggctgtatc agccccagcc tgccgccacc gcccctgaag ctcctcaaag 2220 
gctcactgga ggcggcctcc aacgcccacc tgctgcagtc caagtccggg acccagcccc 2280 
acgcggccac gcggctctcc ctgcagcgca gccgcgggcg gagctgccgg gccagcctga 2340 
gatgaagacg cagctggagc aggacagcat catcgaggcc ctgctgccgc tgagcatgga 2400 
ggccaagatc aagcaggaga tcacagaggg ggaactcaag gccttcatga cagcgcccgg 2460 
cggcaagaag acgcccgcca tgcgcaaggt gctctaccct tgccgcttct gcaaccaggt 2520 
gtttgccttc tcgggggtct tgcgtgccca cgtgcgctcc cacctgggca tctcgccata 2580 
ccagtgcaac atctgcgact acatcgccgc cgacaaggcc gcgctcatcc gccacctgcg 2640 
cacgcacagt ggggagcggc cctacatttg caagatctgc cactacccct tcactgtcaa 2700 
agccaactgc gagcggcacc tgcgcaagaa gcacctcaag gccacccgca aggatatcga 2760 
gaagaacatc gagtatgtga gtagcagcgc ggccgagctg gtggacgcct tctgcgcccc 2820 
ggacaccgtg tgccggctgt gcggcgagga cctcaagcac tatcgtgccc tgcgcatcca 2880 
catgccgacg cactgcggcc gcggcctggg cgggggccac aagggccgca agcccttcga 2940 
gtgcaaggag tgcagcgccg cgttcgcggc caagcgcaac tgcatccacc acatcctcaa 3000 
gcagcacctg cacgtgcccg agcaggacat cgagagctac gtgctggccg ccgacggcct 3060 
gggccccgca gaggcgccgg ccgctgaggc gtcggggcgc ggggaggaca gtggctgcgc 3120 
tgcccttggt gactgcaagc ccctcactgc cttcctggaa ccccagaacg gctttcttca 3180 
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caggggcccc acccagcctc cacctcccca tgtctcgatc aagttggagc ccgccagtag 3240 
ctttgcggtg gacttcaatg agcccctgga cttctcgcag aagggcctgg ccctggtcca 3300 
agtgaagcag gaaaacatct cctttctgag cccttcttcc ctggtcccct atgactgctc 3360 
catggagccc atcgacctgt ccatccccaa gaacttcagg aaaggggaca aggatttggc 3420 
cactcccagc gaagccaaga agcctgagga ggaggcgggg agcagcgagc agccctctcc 3480 
ctgcccagca cccggccctt ctcttcctgt aactttgggg cccagcggaa tcctggaaag 3540 
ccccatggcc cctgctccgg cggccacccc ggaaccccca gcacagcccc tgcagggccc 3600 
tgttcagctg gcggtcccaa tctactcctc agccctggtc agcagccctc cactcgtggg 3660 
cagctcagcc ctcctgagtg gcacagcctt gctgcgtcca ctgcggccca agcccccgct 3720 
gcttttgcca aagccccccg tgacagaaga gctgcccccg ctggcctcca ttgcccagat 3780 
catctcatct gtatcctcgg cccccaccct gctgaaaacc aaggtggcgg acccagggcc 3840 
cgcaagcact ggcagtaaca ccacggcttc agacagctta ggaggttctg tccccaaagc 3900 
cgccaccacc gccacccccg ctgccaccac cagcccaaaa gagtctagtg agcctcccgc 3960 
tccagccaca ggcccagagg ctgcctctcc caccgagcag ggcccagcgg gcacgtcgaa 4020 
gaagaggggc cggaaaaggg ggatgaggag ccgaccccgc gccaacagcg gcggggtgga 4080 
cctgtactcc agcggggagt ttgccagcat cgagaagatg ctggccacta cagacaccaa 4140 
caagttcagt ccgtttctgc agacagcgga ggacaacact caggatgagg tggccggagc 4200 
ccctgccgac caccatgggc ccagtgatga agagcagggc agtcccccag aagacaagct 4260 
gctgagggcc aagcggaact cgtacaccaa ctgcctgcag aagatcacct gtccccactg 4320 
tccccgggtt ttcccttggg ccagctccct acagaggcac atgctcacac acactgacag 4380 
tcagtcggat gcgggagact gcagccgccg cgggcgaagt ggctatgacc tcacctcacg 4440 
ggacagagag cagccgtcgg agggcgccac tgagctccgc caggtcgcag gggatgcgcc 4500 
tgtggagcag gccacggcgg aaacggcctc gcggtgcacc gggaagagca cgggcgtggg 4560 
gagagccatg agccggagga ggagcatggc actgaggaga gcactgggga cgccgacgcg 4620 
gaagaggacg cgtcgagcaa ccagagcctg gacctggact tcgccaccaa gctcatggac 4680 
ttcaagctgg cggagggcga cgcggaggca ggccgggggc gcggcctcgc aggagcagaa 4740 
gctcgcctgc gacgcctgtg ggaagagctt caagttcctg ggcaccctga gccgccaccg 4800 
gaaggcgcac ggccgcccag gaggcccaag gacgagaagg gagatggcgc cagcactgca 4860 
gaggaggggc cccagcccgc ccctgaacag gaggagaagc cccccgagac cccggcagag 4920 
gtggtggagt cggcccgggt cggggaggcc ccggccggaa aagctcgcgg aggagacgga 4980 
gggcccctcc gacggggaga gcgcggccga gaaaaggtcc tcagagaaga gcgacgatga 5040 
caagaaacca aagacagact cccccaaaag cgtggccagc aaggcagaca agaggaagaa 5100 
ggtctgcagc gtgtgcaaca agcggttctg gtcgctgcag gacctgagcc ggcacatgcg 5160 
ctcccacaca ggggaaaggc catacaaatg tcagacctgc gagcgaacct tcaccttgaa 5220 
gcacagcctg gttcgccacc agcggatcca ccagaaagcc aggcatgcca aacaccacgg 5280 
gaaggacagc gacaaggaag agcggggtga ggaggacagc gagaatgagt ccacccacag 5340 
cggcaacaac gccgtctcag agaacgaggc tgagctggct cccaatgcca gcaaccacat 5400 
ggctgtcacc cggagccgga aggagggctt ggccagtgcc accaaggact gcagccacag 5460 
ggaggagaag gtcacgcagg gtggccgtct gagcctggcc agggtgacct taacccagag 5520 
agcccggcgg ccctggggca ggacctgctg gagccgcgca gcaagaggcc tgcccaccca 5580 
atcctggcac agctgatggc gcctcccagc tcgtcgtagg ggatggagtg acagcctcag 5640 
ccccctcagc acagacaaaa gccagcagag caaa 5674 

<210> 93 
<211> 3188 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_£eature 

<223> Incyte ID No: 2754344CB1 

<400> 93 

cccgccatct ctatcagccc cagggatttg gcaagggagg taggcggctc cctttcggcc 60 

actcattggt caaatcctaa tggaggtaag gcactctgcg agtctctatt ggtccctagg 120 

ctccaaaaag ggcggggaaa aacagaccga caggcctcta ggcgggaagc gcggagtgcg 180 
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ccggcgcgta gtcggggacg ccgggccgag 
ccccctcgga gctttttctc cagttaactg 
acatgccggt gcgcttcaag gggctgagtg 
cttatttgtc agagtcttgt aattcctccg 
gatcagatca attaggcatc acgaaagagc 
accatgaccc acagatttca aaatctctgg 
tggttgcatc accagaacca gaagccccgg 
aggatgttat tcaagaaaga gttcactcac 
gatctcactc tgcagactcc agagctgaag 
taacaaacca tacaccagtt aatgaaaatg 
cagaaaatgt agataatggg ttggatagac 
ttccttcata taatgccttg agaaattctg 
ctaaagaaac tgctccagct tttgcagcca 
ttccaccatt caaaggtaac tcagtcatcc 
gtttatctcc agtgaaagaa ccaaaattaa 
aaacagtgtc tcctgaaagg aagagtaata 
agatggaatt aaaagactta caccagccta 
ggcttgggaa ggtgaattcc gaatatagag 
ataaagctgg ggcttggaca catgtaaagg 
ccatgtggta tgctgaggtt aaagaactcc 
ttcaggggac gcatttttct cgggaccatc 
gttgggatgt ctcctcaacc acaagctcag 
tagatcttgc tggagatcct acaagccata 
cagaagaaaa aggaaatatc gtggaagaac 
gtgtgtcagc tcccaccata cccgttagaa 
gtgaagacgt acagaaacag cccggggaga 
gggacaggaa aacgggcaag caggctttta 
agaaatctaa ggcagataag atgaaagaag 
gaggccggct tcctactccc aagctgagag 
atctcactac tccagctgtt ggtggtgctg 
cagccctaga acagaggaaa agaatgacct 
attttactaa gaaagaaagt cgtgctgtat 
aaacagttga tcctctgcct ttgcgggaag 
aggcaactct tccagtttca aaaattccaa 
cttctccacc acatgttcca tcctactggc 
gagatccaga gtttcagcac aatgtgggaa 
aacatgaagc ctttaatgat gaagatgagg 
cagcttctag tctccgggct tttcaaactc 
tctggggtaa aacataaacc tagctgagtt 
tttctttatt gcttactgtg ttaaaatttt 
tttgaaatat ttaatttcca atagaacagt 
caaatttaac ttttaaagaa ttctttttac 
ttgaattctt cacatgtaac tttcgtaata 
agagtattct aaaaaaataa acaaagtgaa 
tattccccta agatttgata tatgtatctt 
ttgttccttt aacctttgct gtagtgtttc 
gcagtgttaa gataattgta ctttgaaaac 
aacaaagttt gctgaaagat atgtatattt 
ggactttggc ttacatttga ataatgtcta 
attaaaactt tgtagacatt ttgaaataaa 
aaaaaaaa 



gctttgctag ggaacactgt tatcgccccg 240* 
tcggagtagc gggggctccg gcgccgggcg 300 
aataccagag gaacttcctg tggaaaaagt 360 
tggggcgaaa gtacccatgg gctggactta 420 
caagttttat ttcaaaaaga agagtccctt 480 
agtggaatgg agctatctca gagagcaatg 540 
aaacaccaaa atcacaagaa gcagaacaaa 600 
tagaagcttc cagggttccc aaaagaacca 660 
gggcttcaga tgtggaaaat aatgagggtg 720 
tggaactgga acattctacc aaggttcttt 780 
ttctgcgtaa gaaagctgga ttgactgttg 840 
aatatcaaag gcagtttgtt tggaagactt 900 
atcaggtttt ccacaataaa agccagtttg 960 
atgaaactga atacaaaaga aatttcaagg 1020 
gaaatgattt gagagaaaac agaaatcttg 1080 
aaatagacga tcgtttaaaa ttggaagcag 1140 
aaaggaagct tactccttgg aaacatcaaa 1200 
caaaatttct gagcccagct cagtatttat 1260 
gaaacatgcc aaatcagggt tctctaaatg 1320 
gagaaaaggc tgagttttat aggaagcgag 1380 
tgaatcagat tttatctgat agcaactgct 1440 
aaggaaccat tagtagcaac atcagagcat 1500 
agactttgca gaaatgtcct tctacagaac 1560 
agccccagaa aaataccacg gagaaattgg 1620 
ggcggctggc ttgggataca gagaacacaa 1680 
aagaggagga ggacgacaat gaagaggaag 1740 
tgggagagca agagaagttg gatgtacatg 1800 
ggtcagattc ttctgtatcc tcagaaaaag 1860 
aacttggtgg aatccagagg actcatcatg 1920 
ttttagtgtc tccatctaag atgaagcctc 1980 
ctcaggattg tttagaaact tcaaagaatg 2040 
ccctactgac ttctccagct gctggtataa 2100 
attctgaaga caatatccac aaatttgctg 2160 
aatacccaac aaatccccct ggacagttgc 2220 
atccctctcg acgaattcag ggctctctta 2280 
aagcaaggat gaacaatttg cagttacctc 2340 
acagattgtc tgagatttct gctcgctctg 2400 
tggcacgagc taagaaaagg aaggagaatt 2460 
gcctttatct aaggaaacat tggcataatt 2520 
ttgagtgaag tttttcctaa cattttctac 2580 
ttacttggaa attgttaaca taggcttata 2640 
atctatccca gagtacaaag acctagatta 2700 
ggtgtcatgg atagattttc acctatccag 2760 
gtaaaagtta agatcccagt gtgaagtata 2820 
tataattgat gatgttgtaa aatgaatttg 2880 
tttttatttt agttccttct atacatgttt 2940 
accagctgta tatatttttc ttatttatgt 3000 
ttgatctttg agtgtattct atttgtgtaa 3060 
gattttgaaa aattacctgt ataatggaga 3120 
aatggatcaa gtgcgaaaaa aaaaaaaaaa 3180 

3188 



<210> 94 

<211> 2209 

<212> DNA 

<213> Homo sapiens 
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<220> 

<221> misc_feature 

<223> Incyte ID No: 8268822CB1 

<400> 94 

agcgggttta ctccggcgcg cagtgccggc cggggcgggg atacgccgtg tgcgcggccg 60 
ggggcggaac ggaggaggag gcggtggtgt cccggctgcg gggtaggagt ccgcggcagc 120 
ctccgggtaa gccaagcgcc gcgcagtgct gagttcccgc acgccgcaga gccatggaga 180 
tcggcaccga gatcagccgc aagatccgga gtgccattaa ggggaaatta caagaattag 240 
gagcttatgt tgatgaagaa cttcctgatt acattatggt gatggtggcc aacaagaaaa 300 
gtcaggacca aatgacagag gatctgtccc tgtttctagg gaacaacaca attcgattca 360 
ccgtatggct tcatggtgta ttagataaac ttcgctctgt tacaactgaa ccctctagtc 420 
tgaagtcttc tgataccaac atctttgata gtaacgtgcc ttcaaacaag agcaatttca 480 
gtcggggaga tgagaggagg catgaagctg cagtgccacc acttgccatt cctagcgcga 540 
gacctgaaaa aagagattcc agagtttcta caagttcgca ggagtcaaaa accacaaatg 600 
tcagacagac ttacgatgat ggagctgcaa cccgactaat gtcaacagtg aaacctttga 660 
gggagccagc accctctgaa gatgtgattg atattaagcc agaaccagat gatctcattg 720 
acgaagacct caactttgtg caggagaatc ccttatctca gaaaaaacct acagtgacac 780 
ttacatatgg ttcttctcgc ccttctattg aaatttatcg accacctgca agtagaaatg 840 
cagatagtgg tgttcattta aacaggttgc aatttcaaca gcagcagaat agtattcatg 900 
ctgccaagca gcttgatatg cagagtagtt gggtatatga aacaggacgt ttgtgtgaac 960 
cagaggtgct taacagctta gaagaaacgt atagtccgtt ctttagaaac aactcggaga 1020 
aaatgagtat ggaggatgaa aactttcgga agagaaagtt gcctgtggta agttcagttg 1080 
ttaaagtaaa aaaattcaat catgatggag aagaggagga agaagatgat gattacgggt 1140 
ctcgaacagg aagcatctcc agcagtgtgt ctgtgcctgc aaagcctgaa aggagacctt 1200 
ctcttccacc ttctaaacaa gctaacaaga atctgatttt gaaggctata tctgaagctc 1260 
aagaatccgt aacaaaaaca actaactact ctacagttcc acagaaacag acacttccag 1320 
ttgctcccag aactcgaact tctcaagaag aattgctagc agaagtggtc cagggacaaa 1380 
gtaggacccc cagaataagt ccccccatta aagaagagga aacaaaagga gattctgtag 1440 
aaaaaaatca aggaactcaa cagaggcaat tattatcccg actgcaaatc gacccagtaa 1500 
tggcagaaac tctgcagatg agtcaagctg agatgagtga actgagtgtg gcacagaaac 1560 
cagaaaaact tttggagcgc tgcaagtact ggcctgcttg taaaaatggg gatgagtgtg 1620 
cctaccatca ccccatctca ccctgcaaag ccttccccaa ttgtaaattt gctgaaaaat 1680 
gtttgtttgt tcacccaaat tgtaaatatg atgcaaagtg tactaaacca gattgtccct 1740 
tcactcatgt gagtagaaga attccagtac tgtctccaaa accagcagtt gcaccaccag 1800 
caccaccttc cagtagtcag ctctgccgtt acttccctgc ttgtaagaag atggaatgtc 1860 
ccttctatca tccaaaacat tgtaggttta acactcaatg tacaagaccg gactgcacat 1920 
tctaccatcc caccattaat gtcccaccac gacatgcctt gaaatggatt cgacctcaaa 1980 
ccagcgaata gcacccagtc ctgcctggca gaagatcatg cagtttggaa gttttcatgt 2040 
actgatgaaa gatactctac agaacttgtc aaatctttga aacttggaat atattgcttt 2100 
cataatatga agttttattg cctatctaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2160 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaag aaaaaagaag ccacccccc 2209 

<210> 95 
<211> 2783 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_f eature 
<223> Incyte ID No: 1814553CB1 

<400> 95 

ggagcaggac gcgccggggc cgcctcctcc 
cggcggctcc gcagggagcg cttgaagcgc 
gagccagtgc cagccgaggc ctcgctgagt 



cgcacggacc catgaaccag ccgggcggcg 60 
agccagaaga gcaccaaggt ggagggccca 120 
gccgagcaag gaacgatgac ggaggtgaag 180 
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gtgaagacag agctgcccga tgactacatc caggaggtga tctggcaggg cgaggccaag 240 
gaggagaaga aggcggtcag caaggatggg accagcgacg tgcctgccga gatctgcgtg 300 
gtgatcggcg gcgtccgcaa ccagcagacc cttgggtctt acgaatgcgg aatctgtggc 360 
aagaagtaca agtattacaa ctgcttccag acccacgtgc gggcgcaccg agacaccgaa 420 
gccacctcag gggagggagc ctcccaaagc aacaatttca ggtacacatg tgatatctgc 480 
gggaagaagt acaaatacta cagctgtttc caagagcacc gagacctgca cgcagtggat 540 
gtgtttagtg tggaaggggc ccctgagaac cgggcagacc ccttcgacca aggtgtcgtg 600 
gccacggacg aggtgaagga ggagcccccg gagccattcc agaaaatcgg gccaaaaact 660 
ggcaattaca cctgtgaatt ctgcggcaaa cagtacaagt actacactcc ctaccaggag 720 
catgtggcct tacacgcccc catcagtact gcccccgggt gggagccacc ggatgatcca 780 
gacacgggct ctgagtgttc acatccagag gtctccccat ctccacgctt cgtggcagcg 840 
aagacccaga cgaaccagtc ggggaaaaaa gctccggcct ccgtggtccg atgcgccacc 900 
ctcttacacc gcacccctcc agccacccaa acccagacgt tccgcactcc aaattcggga 960 
tctccggcga gcaaggcaac cgcagaaagc gctttcagtc ggagagtaga aggcaaagca 1020 
caaaaccact ttgaagagac gaacagcagt tcgcagaact ccagcgaacc ctacacctgc 1080 
ggcgcctgtg ggatccagtt ccagttctac aacaacctgt tggagcacat gcagtcccat 1140 
gcagctgaca atgaaaacaa cattgcctcc aaccagtccc gatcgccacc tgctgttgta 1200 
gaagagaagt ggaaacctca ggcccagagg aacagtgcca ataacaccac taccagtggt 1260 
ttaacaccca acagcatgat ccccgaaaag gagcggcaga acatcgcaga gcggctgttg 1320 
agggtcatgt gtgccgacct gggtgcactg agcgtggtca gcgggaagga gttcctgaag 1380 
ttggcccaga ccttagtaga cagtggtgcc cgctatgggg ccttctcggt cactgaaatc 1440 
ctgggcaact tcaacacgct ggcgctgaag cacctgccac gcatgtacaa ccaggtgaag 1500 
gtgaaagtga cctgtgcctt gggcagcaat gcctgcctag gcatcggtgt cacctgccac 1560 
tcccagagtg ttggccctga ctcctgctac atcctcacag cctaccaggc cgagggcaac 1620 
cacatcaaga gctatgtgct tggtgtgaag ggtgcggaca ttcgcgacag cggtgacctt 1680 
gtgcaccact gggtgcagaa cgtgctgtcg gagttcgtga tgtcggagat caggacagtg 1740 
tacgtgacgg attgccgggt gagcacgtcc gccttctcca aggccggcat gtgccttcgc 1800 
tgctcagcct gtgccttgaa ctcggtggtg cagagcgtgc tgagcaagcg gacactgcag 1860 
gcccgcagca tgcacgaggt catcgagctg ctcaacgtgt gcgaggacct ggcgggctcc 1920 
acgggcctgg ccaaggagac cttcgggtcg ctggaggaga cgtctccacc accctgctgg 1980 
aactcggtga cggactcact gttgctggtg catgagcgct atgagcagat ctgcgagttc 2040 
tacagccggg ccaagaagat gaacctcatc cagagcctca acaagcacct gctcagcaac 2100 
ctggcggcca tcctgacgcc ggtgaagcag gcagtcatcg agctgagcaa cgagagccag 2160 
cccaccctgc agctggtgct gcccacctac gtcaggctgg agaagctgtt cacggccaag 2220 
gccaacgacg caggcactgt cagcaagctc tgccacctct tcctggaggc gctcaaggag 2280 
aacttcaagg tgcacccggc ccacaaggtg gccatgatcc tggacccgca gcagaagctg 2340 
cggcctgtgc caccctacca gcacgaggag atcatcggca aggtctgtga gctcatcaac 2400 
gaggtgaagg agtcctgggc cgaggaggcc gacttcgagc ccgctgccaa gaagccccgc 2460 
tctgctgccg tcgagaaccc cgcagctcag gaagatgatc ggctaggcaa aaatgaagtg 2520 
tacgattacc tgcaggagcc cctcttccag gctacccctg atctcttcca gtactggtcg 2580 
tgcgttaccc aaaagcacac aaaactcgcc aagctcgcct tctggctcct ggcggttccg 2640 
gccgtgggcg ccagaagcgg gtgtgtaaat atgtgtgaac aagcgcttct aatcaaacgg 2700 
aggcggctgc tcagtccaga agatatgaat aaactcatgt ttctgaaatc caacatgctt 2760 
taagacttga cttcggggga aaa 2783 

<210> 96 

<211> 1610 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 71217830CB1 

<400> 96 

aattaattat atgtgacacc actgcactcc agccttggtg acagagtgaa actccatcta 60 
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aaaacaaaac aaaatgaaac aaaaaccaaa taccctttaa atacatcctc gtgtactggt 120 

gtgtatgtat ctatggtatg aattcctaaa agttgaccta tttttttaag ttccatctaa 180 

ataaaatttc tcctttacaa catccactat tgtcatttct cctccccgtg gtttttgtgt 240 

ctttttagcc tcttgttcat ttactgcctt tcatgttccc atgctattgc cttggttttt 300 

taacaaaatt cttcctctgt gcctatcctg tcttacctac aaatagaatc ttcttgaggg 360 

caaggatggt ttcttgtacg gtgtattggc actcggtaaa taaataatga gagagggatt 420 

gagtgttggt atgagaaaag gagcctaatg gttgcatttg gatgaatggg tgctgagaga 480 

tggtatgtac atttatacat tcttgtctca tcttgtaaaa tgtttgaaaa taagagtaat 540 

tttcaacttg atttttctta ttttaggcaa gtatatcgcg tcaacacagc gacctgacgg 600 

gacctggcgc aagcagcgga gggtgaaaga aggatatgtg ccccaggagg aggtcccagt 660 

atatgaaaac aagtatgtga agtttttcaa gagtaaacca gagttgcccc cagggctaag 720 

ccctgaggcc actgctcctg tcaccccatc caggcctgaa ggtggtgaac caggcctctc 780 

caagacagcc aaacgtaacc tgaagcgaaa ggagaagagg cggcagcagc aagagaaagg 840 

agaggcagag gccttgagca ggactcttga taaggtgtcc ctggaagaga cagcccaact 900 

ccccagtgct ccacagggct ctcgggcagc ccccacagct gcatctgacc agcctgactc 960 

agctgccacc actgagaaag ccaagaagat aaagaaccta aagaagaaac tccggcaggt 1020 

ggaagagctg cagcagcgga tccaggctgg ggaagtcagc cagcccagca aagagcagct 1080 

agaaaagcta gcaaggagga gggcgctaga agaggagtta gaggacttgg agttaggcct 1140 

ctgaggcctt tggggaatag ggaatggact gcagaacaaa ccgtggggct ctctggggtc 1200 

tgggggaata cgggcaacag cagtcaggag gggtaccccc catactggct tacttccacc 1260 

tcctgcggcc cagctctgtc ctccagagcc tagcgtctcc ctcaatcctt cccttttctt 1320 

cccaacttct actttttgga ctttccccct cccattccca gtgttcaaaa tctcagtgac 1380 

taccccaggt acctttgctg ctgatttggg tgtcttgttt aaaagaaaat caggtgggtg 1440 

ggaatctctt ggagaactga ggctgagggt agagggagta tgcccaagtc ttggagtctt 1500 

ggttcctgtt cgcggtgttt atgggttatt tccctctcca tccctcattt tttttttttt 1560 

tttaaaaaaa gcaaaaatga gaataaacac aagtagactt gtaaacaaaa 1610 

<210> 97 

<211> 3467 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 7506252CB1 

<400> 97 

gcgggcgcgc gcgccatgga gcggctgcgg gacgtgcggg agcggctgca ggcgtgggag 60 
cgcgcgttcc gacggcagcg cgggcggcga ccgagccagg acgacgtgga ggcggcgccg 120 
gaggagaccc gcgcgctcta ccgggagtac cgcactctga agcgtaccac gggccaggcc 180 
ggcggcgggc tccgcagctc cgagtcgctc cccgcggcgg ccgaagaggc gccagagccc 240 
cgctgctggg ggccccatct gaatcgggct gcgaccaaga gtccacagcc tacgccaggg 300 
cggagccgcc agggctcggt gccggactac gggcagcggc tcaaggccaa tctgaaaggc 360 
accctgcagg ccggaccagc cctgggccgc agaccgtggc ctctaggaag agcctcatct 420 
aaggcatcca ccccaaagcc cccaggtaca gggcctgtcc cctcctttgc agaaaaagtc 480 
agtgatgagc ctccacagct ccctgagccc cagccaaggc caggccggct ccagcatctg 540 
caggcatccc tgagccagcg gctgggctcc ctagatcctg gctggttaca gcgatgtcac 600 
agtgaggtcc cagattttct gggggccccc aaagcctgca ggcctgatct aggctcagag 660 
gaatcacaac ttctgatccc tggtgagtcg gctgtccttg gtcctggtgc tggctcccag 720 
ggcccagagg cttcagcctt ccaagaagtc agcatccgtg tggggagccc ccagcccagc 780 
agcagtggag gcgagaagcg gagatggaac gaggagccct gggagagccc cgcacaggtc 840 
cagcaggaga gcagccaagc tggaccccca tcggaggggg ctggggctgt agcagttgag 900 
gaagaccctc caggggaacc tgtacaggca cagccacctc agccctgcag cagcccatcg 960 
aaccccaggt accacggact cagcccctcc agtcaagcta gggctgggaa ggctgagggc 1020 
acagcccccc tgcacatctt ccctcggctg gcccgccatg acaggggcaa ttacgtacgg 1080 
ctcaacatga agcagaaaca ctacgtgcgg ggccgggcac tccgtagcag gctcctccgc 1140 
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aagcaggcat ggaagcagaa gtggcggaag aaaggggagt gttttggggg tggtggtgcc 1200 
acagtcacaa ccaaggagtc ttgtttcctg aacgagcagt tcgatcactg ggcagcccag 1260 
tgtccccggc cagcaagtga ggaagacaca gatgctgttg ggcctgagcc actggttcct 1320 
tcaccacaac ctgtacctga ggtgcccagc ctggacccca ccgtgctgcc actctactcc 1380 
ctggggccct cagggcagtt ggcagagacg ccggctgagg tgttccaggc cctggagcag 1440 
ctggggcacc aagcctttcg ccctgggcag gagcgtgcag tcatgcggat cctgtctggc 1500 
atctccacgc tgctggtgct gcctacaggt gccggcaagt ccctgtgcta ccagctccca 1560 
gcgctgctct acagccggcg cagcccctgc ctcacgttgg tcgtctctcc cctgctgtca 1620 
ctcatggatg accaggtgtc tggcctgcca ccgtgtctca aggcggcctg catacactcg 1680 
ggcatgacca ggaagcaacg ggaatctgtc ctgcagaaga ttcgggcagc ccaggtacac 1740 
gtgctgatgc tgacacctga ggcactggtg ggggcgggag gcctccctcc agccgcacag 1800 
ctgcctccag ttgcttttgc ctgcattgat gaggcccact gcctctccca gtggtcccac 1860 
aacttccggc cctgctacct gcgcgtctgc aaggtgcttc gggagcgcat gggcgtgcac 1920 
tgcttcctgg gcctcacagc cacagccaca cgccgcactg ccagtgacgt ggcacagcac 1980 
ctggctgtgg ctgaagagcc tgacctccac gggccagccc cagttcccac caacctgcac 2040 
ctttccgtgt ccatggacag ggacacagac caggcactgt tgacgctgct gcaaggcaaa 2100 
cgttttcaaa acctcgattc cattatcatt tactgcaacc ggcgcgagga cacagagcgg 2160 
atcgctgcgc tcctccgaac ctgcctgcac gcagcctggg tcccagggtc tggaggtcgt 2220 
gcccccaaaa ccacagccga ggcctaccac gcgggcatgt gcagccggga acggcggcgg 2280 
ccccagggcg aagacctgcg agagctgcgc agacatgtgc acgccgacag cacggacttc 2340 
ctggctgtga agaggctggt acagcgcgtg ttcccagcct gcacctgcac ctgcaccagg 2400 
ccgccctcgg agcaggaagg ggccgtgggt ggggagaggc ctgtgcccaa gtacccccct 2460 
caagaggctg agcagcttag ccaccaagca gccccaggac ccagaagggt ctgcatgggc 2520 
catgagcggg cactcccaat acagcttacc gtacaggctt tggacatgcc ggaggaggcc 2580 
atcgagactt tgctgtgcta cctggagctg cacccacacc actggctgga gctgctggcg 2640 
accacctata cccattgccg tctgaactgc cctgggggcc ctgcccagct ccaggccctg 2700 
gcccacaggt gtcccccttt ggctgtgtgc ttggcccagc agctgcctga ggacccaggg 2760 
caaggcagca gctccgtgga gtttgacatg gtcaagctgg tggactccat gggctgggag 2820 
ctggcctctg tgcggcgggc tctctgccag ctgcagtggg accacgagcc caggacaggt 2880 
gtgcggcgtg ggacaggggt gcttgtggag ttcagtgagc tggccttcca ccttcgcagc 2940 
ccgggggacc tgaccgctga ggagaaggac cagatatgtg acttcctcta tggccgtgtg 3000 
caggcccggg agcgccaggc cctggcccgt ctgcgcagaa ccttccaggc ctttcacagc 3060 
gtagccttcc ccagctgcgg gccctgcctg gagcagcagg atgaggagcg cagcaccagg 3120 
ctcaaggacc tgctcggccg ctactttgag gaagaggaag ggcaggagcc gggaggcatg 3180 
gaggacgcac agggccccga gccagggcag gccagactcc aggattggga ggaccaggtc 3240 
cgctgcgaca tccgccagtt cctgtccctg aggccagagg agaagttctc cagcagggct 3300 
gtggcccgca tcttccacgg catcggaagc ccctgctacc cggcccaggt gtacgggcag 3360 
gaccgacgct tctggagaaa atacctgcac ctgagcttcc atgccctggt gggcctggcc 3420 
acggaagagc tcctgcaggt ggcccgctga ctgcactgca ttggggg 3467 

<210> 98 
<211> 2392 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> miac_f eature 
<223> Incyte ID No: 2270608CB1 

<400> 98 

actaggctga acgggccgag cactcgagcg 

ggggcctcgg tccaggtgcg cggtagcgct 

cggcccgtgc agcgctcccg cgagacgctc 

gggggagggg ctgagccgga ggcggctcac 

ccatgcaatg agccatgccc cgccctggac 

tgggaccgtg ggagcggcca acagagctat 



gggacgcgca gctctgcaga gccccggggc 60 

tcgcggaggc aggaagtgtg tccctagcgg 120 

acctgcgccc caggtgagcg gcgagggggc 180 

ctggcgggac aggtgcctgg ctgctacaaa 240 

acccccgccc agcatctggg cctccacgct 300 

gtctggagac atatgataaa ccacctcagc 360 



127/141 



WO 03/054219 



PCT/US02/41115 



ccccaccaag ccgccgcacc cgtagaccag accccaagga ccctggccac catgggccag 420 
agagcattac cttcatctct ggctctgctg agccggccct tgagtccccc acctgctgcc 480 
tgctctggcg accctgggtg tgggagtggt gccgggctgc cttctgcttc cgccgctgcc 540 
gggattgcct ccagcgctgt ggagcctgtg tgcggggatg cagcccctgc ctgtctactg 600 
aggactccac tgaggggact gctgaagcca actgggccaa ggagcacaat ggagtgcccc 660 
ccagccctga tcgtgcaccc cccagccggc gggatggcca gcggctcaag tcaaccatgg 720 
gcagcagctt cagctacccc gatgttaagc tcaaaggcat ccctgtgtat ccctacccga 780 
gggccacctc cccagcccct gatgcggact cctgctgcaa ggagccactg gccgatcccc 840 
cacccatgcg acacagcctg cccagcacct ttgccagtag tcctcgtggc tccgaggagt 900 
actattcttt ccatgagtcg gacctggacc tgccggagat gggcagtggc tccatgtcga 960 
gccgagaaat tgatgtgctc atcttcaaga agctgacaga gctgttcagc gtacaccaga 1020 
tcgatgagct ggccaagtgc acatcagaca ctgtgttcct ggagaagacc agtaagatct 1080 
cggaccttat cagcagcatc acacaggact accacctgga tgagcaggat gctgagggcc 1140 
gcctggtacg cggcatcatt cgcattagta cccgaaagag ccgtgctcgc ccacagacct 1200 
cggagggtcg ttcaactcgg gctgctgccc caaccgctgc tgcccctgac agtggccatg 1260 
agaccatggt gggctcaggt ctcagccagg atgagctgac agtgcagatc tcccaggaga 1320 
cgactgcaga tgccatcgcc cggaagctga ggccttatgg agctccaggg tacccagcaa 1380 
gccatgactc atccttccag ggcaccgaca cagactcgtc gggggcaccc ttgctccagg 1440 
tgtactgcta acccctgcca ggcccagctg ccacaccctt tctgggagaa gcatggccta 1500 
cagaatgaag agggggacca ggaacccctg tgggagaggc ttagacctga agcagtgccc 1560 
actctggctc ctcctgcctt ggctgactgg gttcctggac catgtgcatt tcactgggcc 1620 
atgggatcta catctccttg catccccagc tggtctgatc cctgccaggg ccccttcctt 1680 
cctgctcatg gtcttcaggt ggcctgatca tggaaagtaa ggagttaggc attaccttct 1740 
gggagtgaac cctgactcca tccccctatt gccaccctaa ccaatcatgc aaacttctcc 1800 
ctccctgggg taattcaaca gttaaaagaa gcttatctta aatgtattgt attggggggt 1860 
gggcagggcc cactctatgt tatgttaagg agttggttct ggttcttggc tgatgttctg 1920 
tatcttaaca tgaccacagt ttgtaagtac aaaggtaatt ggtatctgcc tgctttcttg 1980 
gctgtccttt ctctcccatc tgcctcactt aactgctgcc cagagcagga gacaggatat 2040 
gtgtccagcc tggcttagtg cccgcagtta aagtaggaag gataaaatgg gagtaatgct 2100 
gcccccattt caggttcact gtgagttcca ggagataaag aaccaagcac agagtctgtg 2160 
cctgagaaag gttggataaa tggtttctga cattgtttga gaggttcttc tagggaggcc 2220 
tgtccgaact gcctctgctc tcccagtcca ctgcctgcag gctggcctgc ttctgcccag 2280 
cccacagttt ccccaaagcc ttgatcattt gttggcccta cccagccttg gacctgctgc 2340 
tccccttcca ctctctctgt ggtagtaaca gagcactgtg gttttttttt tt 2392 

<210> 99 
<211> 1326 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 7502428CB1 

<400> 99 

tgtgtcccct taagcatgtc tgttatgagc ttccttgact tcacacctct cagatttgac 60 
aaacatgagg agaaacaaag caaactctct tttctgtggg ttaatttgtc tctcattttg 120 
ggtccttcgt atatcagagt gtgaatgata ctttattgtc ctcacttgtg cttcaattcg 180 
tggctgtttt gcaggaggag ctcctaaaaa aaatgcagtc tttatgggaa aaagcttgtg 240 
aaaatctcag aaatctgaac atggaaacca caagaaccag atgctggaag gattatgtga 300 
gtttaaggat agaagcaatc agagctgaat atcagaagat gcctgcattt ctccatgaag 360 
aagagcaaca tcacttggaa aggctgcgaa aggagggcga ggacattttt cagcaactca 420 
atgaaagcaa agccagaatg gaacattcca gggagctttt aagaggaatg tatgaggatc 480 
tgaagcaaat gtgccataaa gcagatgtgg agctactcca ggcttttgga gacatattac 540 
acaggtatga gtctctgctg ctgcaagtgt ctgagcctgt gaatccagag ctcagtgcag 600 
ggcccatcac tggactgctg gacagcctca gtggattcag agttgatttt actctgcagc 660 
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ctgaaagagc caatagtcat atcttcctgt gtggagattt gagaagcatg aatgttggat 720 
gtgaccctca agatgatccc gatatcactg gaaaatctga atgttttctt gtatgggggg 780 
ctcaggcttt cacatctggc aaatattatt gggaggttca tatgggggac tcttggaatt 840 
gggcttttgg tgtctgtaac aattattgga aagagaagag acagaatgac aagatagatg 900 
gagaggaggg actctttctt cttggatgtg ttaaggagga cactcactgc agtctcttta 960 
ccaccacccc acttgtggtg caatatgttc caagacctac cagcacagta ggattattcc 1020 
tggattgtga aggtagaatc atgagctttg ttgatgttga tcaaagtttc ctgatataca 1080 
tcatccctaa ttgctccttc tcacctcctc tcaggcctat cttttgctgt agtcacttct 1140 
gaccagagaa aagtcagaaa tgtgcctgta tgctctggga acctgtttat cccagaaagc 1200 
cctctttttc gcacctcatc aaacagaaca aataagttat atttaatgtc tttagttgca 1260 
ttctaatgtc atcaaaactc atttatagtg tttctattaa atatggtgaa aacattaaaa 1320 
aaaaaa 1326 



<210> 100 
<211> 2245 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 368741CB1 



<400> 100 

ggaccattct ggtcaggtgt ggggggttgt tgtgtacact ggtggccatt ttttggcttt 60 
cagacaatgg tgagaatatc tagggagctg ctgatgcaca ttgaggtctc ttgctttagg 120 
aactgatgac gcttgataat gtggctgggg acttcaggga agagtgggtt acctggacac 180 
tgcttagaga atcctctcca ggaatgccac ccagcacagt tagaagaatg ggctctcaaa 240 
ggaatttcca ggcctagtgt aatctcccag ccggagcaga aagaagagcc atgggtccta 300 
ccactccaaa actttgaggc gaggaagatc ccgagggaaa gccacacaga ctgtgagcat 360 
caggtggcaa agctcaatca ggacaattct gaaacagcag aacaatgtgg aacatcctca 420 
gaaaggacca ataaagatct ttctcatact cttagttggg gaggaaactg ggagcaaggc 480 
ctagaattag aagggcaata tggaaccctt ccaggagagg gccagctgga gtccttttca 540 
caggagaggg atttaaacaa gctcctggat ggatatgtag gagagaagcc tatgtgtgca 600 
gaatgcggga aaagctttaa ccagagttcc tatctcataa gacacctaag aacccacact 660 
ggcgagaggc cctatacgtg cattgagtgt gggaaaggct tcaaacagag ctcagacctt 720 
gtcacccatc gcagaacaca cacaggagag aagccctacc aatgcaaggg gtgtgagaag 780 
aaattcagcg acagctcaac actcatcaaa catcagagaa cccacacagg ggagagaccc 840 
tatgagtgcc cagagtgtgg aaagactttt gggcggaagc cacacctcat aatgcaccaa 900 
agaacccaca caggcgagaa gccctacgcg tgcctggaat gtcacaaaag cttcagtcga 960 
agctcaaatt tcatcactca ccagaggacc cacacagggg tgaagcctta caggtgtaat 1020 
gactgtgggg agagttttag ccagagctcg gatttgatta agcaccaacg aacccacacg 1080 
ggagaacggc ccttcaaatg cccggagtgc gggaagggct tcagagatag ttctcatttt 1140 
gtagctcaca tgagcactca ttcaggagag aggcctttca gttgtcctga ctgccacaaa 1200 
agcttcagtc agagctcaca tttggtcacg caccaaagaa cacacacagg tgagagacct 1260 
tttaagtgcg aaaactgtgg gaaaggattc gccgacagct ccgccctcat taagcaccaa 1320 
cgaatccaca ccggagaaag accctacaaa tgtggagagt gtgggaagag cttcaatcag 1380 
agctcccact ttattaccca tcagcgaatc cacttaggag acaggcccta tcgatgtcct 1440 
gagtgtggca agaccttcaa tcagcgttcc catttcctca cacaccagag aacgcataca 1500 
ggagaaaaac ctttccactg tagtaaatgt aacaagagct tccgtcagaa agcgcatctt 1560 
ttatgccatc aaaacaccca tttgatttag gaagtagtct ttggtgttca gctgctccct 1620 
tgcacatttt cattgctact gtcttcaagc accccaaata gagaaaacct gggcgtcagt 1680 
ggctcaattt gggccctgat ctattctccc tctttcttgt ctatgttata acagagagga 1740 
taaacttaaa gggtccaaat aacggtccga atacaaaagg cattccttca gtgtgtgact 1800 
gactcttagg gaaatgtgag tttaatagtt gatgcccgcc aggcgtggtg gctcacccct 1860 
gtaatcccag cacttttggg aggccaaggt gggtggatca cttgaggtca ggagttgaga 1920 
ccagcctggt gagcatggtg aaacctcatc tctactaaaa atgcaaaaat tagctgggca 1980 
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cgggaggccg aggcagaaga atcatttgaa 2040 
atcatgccac tgcactccag cctgggcaac 2100 
aaagttgatg cctagttact aaatgagaag 2160 
tcaccaccta gtacagtgcc tggcacaaca 2220 

2245 

<210> 101 
<211> 547 
<212> DNA 
<213> Homo sapiens 



tggtggcacg Cgcctgtagt cccagctact 

ctcagaaggt gcaggttgca gtgagttgag 

agagagagac tctgtctcca agaaaattaa 

tgagaaaaat gtggcctaga gatcacatgt 

tagatggtca ataactaaag tttgg 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7506379CB1 



<400> 101 

gtggcggcga acgcggagag cacgccatga aggcctcggg cacgctacga gagtacaagt 60 
taaagaagat gaagaagtct tcaggggaga ttgtctactg tgggcaggtg tttgagaagt 120 
cccccctgcg ggtgaagaac ttcgggatct ggctgcgcta tgactcccgg agcggcaccc 180 
acaacatgta ccgggaatac cgggacctga ccaccgcagg cgctgtcacc cagtgctacc 240 
gagacatggg tgcccggcac cgcgcccgag cccactccat tcagatcatg aaggtggagg 300 
agatcgcggc cagcaagtgc cgccggccgg ctgtcaagca gttccacgac tccaagatca 360 
agttcccgct gccccaccgg gtcctgcgcc gtcagcacaa gccacgcttc accaccaaga 420 
ggcccaacac cttcttctag gtgcagggcc ctcgtccggg tgcagggccc tcgtccgggt 480 
gtgccccaaa taaactcagg aacgccccgg tgaaaaaaaa aaaaaaaaaa aaaaaaaaaa 540 
aattggt 547 

<210> 102 
<211> 880 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7506253CB1 



<400> 102 

tccgggccca aggctgaggg acccggtgga 
agcggcggcg agaggatgaa caacaagttc 
catgatcaga atgaagaaaa cagcacacag 
gacaagaatc agagcagtag caagagaaag 
atggtacgtc ctggggacct gacaggccac 
aaacccatgt gggaggatga tgcaaataaa 
aagggcttgg cctcccgttg ctgggagaat 
atggttgggg aggagatctg tggggctgtg 
tcaatatgga ataagactgc cagtgaccaa 
cggcgagtgc ttaacctacc tcccaacacc 
atcaaaatgc caggcaggct gggcccccaa 
cggttgaatg tgccatgacc ctctccctct 
atcggagtct cttgttctgt tggcgtgcta 
gaattggaag agcattttat gttttaagaa 
ctgagatcac ttaataaatg gtgctaaact 



gcggaagtca ctccctgagg cagtggcgac 60 
gacgctttga aagatgatga cagtggggac 120 
aaagatggtg agaaggaaaa aacggaacga 180 
gtggagcagt tctggaggtt ttatagccac 240 
agtgacttcc atctcttcaa agaaggaatt 300 
aatggtggca agtggattat tcggctgcgg 360 
ctcattttgg ccatgctggg ggaacagttc 420 
gtgtctgtcc gctttcagga agacattatt 480 
gcaaccacag cccgaatccg ggacacactt 540 
attatggaat acaaaactca caccgacagc 600 
aggctccttt ttcaaaacct ctggaagccg 660 
ctggatggca ccatcattga agctggcgtc 720 
cctggaagat ccttctgtcc tggacaagag 780 
caggctgaca cgcagcagct acaacaacag 840 
aaaaaaaaaa 880 



<210> 103 
<211> 813 
<212> DNA 
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<213> Homo sapiens 



<220> 

<221> misc_£eature 

<223> Incyte ID No: 7506353CB1 



<400> 103 

cccggaaacc ccaaggccca gacggctctc agatccgggg gctgcggata aatccccgta 60 

ggccgcgggc agcgagatgt tgcgttccgg tgtgggtgtg ggtgtgcctc cgacggcgtc 120 

tcggtgccag tgtcgaggtt ctttctgctt agctacccgg agccgactac ggaggaggac 180 

acctgagttt acgtctcttc catctgctgc tcgcctcagc tgcctgggtc cccgacgaga 240 

gccaggtgac acttaactcc gccatctgcg ttttgagcac tgttctcata atggagtttc 300 

ctgatttggg gaagcattgt tcagaaaaga cttgcaagca gctagatttt cttccagtaa 360 

aatgtgatgc atgtaaacaa gatttctgta aagatcattt tccatacgct gcacataagt 420 

gtccgtttgc attccagaag gatgttcacg tcccagtatg cccactctgt aataccccca 480 

tcccagtaaa aaagggccag ataccagacg tggtggttgg tgatcacatt gacagagact 540 

gtgactctca ccctgggaag aagaaagaga agcacagaca ccctttggac cacagctgca 600 

gacacgggag tcgccccacc atcaaagctg ggtgagaaga gactccgctg cgatggctcg 660 

gagcacgcag tagcatgcgt ggaagcagct tacactctag tgggaagtgg agccccattg 720 

agcaccatcc cacactggct gctgatcttg tttgttgagg gagatgggac tataataaaa 780 

ttgaatgctg aagttgaaaa aaaaaaaaaa agg 813 

<210> 104 

<211> 3014 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7506372CB1 



<400> 104 

cgctatggcg gcggtgttga ggttgggtac 
gccaagtgga ggtatcaggg acgtcgcgcg 
gacgctgtca tgtacggcgc gagcgggggc 
aaggaggagg ccgggccagg cggtgccggc 
ttcggctatg cctgcaagct gttccgggac 
cagcacctca tcccctggat gggggaccac 
ggtcacctgc atgacctttc tgagtacgat 
cagctgtctg aagaggaggc gcgaatagag 
ttgcatacgg acttgcttga ggaggaggca 
gaagcactag cagaggatgg gagctacaat 
tacgacccgt cagagccgac ggaggaggag 
gccgaaaatt tagaggaaaa tgaagagccc 
tctgacgtgg agttgccacc aaccgctaaa 
ttcgtgtgca ggcagggagc acagtttgag 
tcccagtttg actttctgcg cttcgaccac 
aaagccatga aagagggacg ctacactgtc 
aaatcaggag tcagctctga caatgaagat 
catccctctc tctttgcctc caagaagtgt 
aaggtagtgg acccagatca tcccctcgca 
tccactccca ccccacacaa cgcagacggt 
acggcagact cgaccgtggc agccatgtat 
tactgcctgg cgccgccccc tcccggaatc 
gctggcgtga ccgtgtctaa ctcccctgga 
accacaccac caccgccccc aaccacagca 



gggatgcggg gtctttgact gaaggggtag 60 
gcacagaaga ggaccagcct ggacgccggg 120 
cgcgccaaac ccgagaggaa aagcggcgcg 180 
ggtgggggca gccgagtgga gctcttggtt 240 
gacgagcggg ccctggctca ggaacaggga 300 
aagatcctca tcgacagata tgatggacgt 360 
gctgagtatt ccacgtggaa cagagattat 420 
gccctgtgtg atgaagagag gtatttagcc 480 
aggcaagagg aagaatacaa gcgattgagt 540 
gccgtggggt tcacttacgg tagcgactat 600 
gagccttcca aacagagaga aaaaaatgag 660 
ttcgttgccc ccttaggatt gagcgtcccg 720 
atgcacgcca tcatcgagcg cacggccagc 780 
atcatgctga aggccaagca ggcccggaac 840 
tacctcaacc cctactataa gttcatccag 900 
ctggcagaaa acaaaagtga cgagaaaaaa 960 
gatgatgatg aagaagatgg gaattacctt 1020 
aaccgccttg aagagctgat gaagcccttg 1080 
gcacttgttc gtaaggcaca ggctgacagt 1140 
gcgcctgtgc agccctccca ggtggagtac 1200 
tacagctact acatgctacc ggacggcact 1260 
gacgtgacta cttactacag cacccttcct 1320 
gtgacgacca ccgccccacc acctcctggg 1380 
gagactagca gcggggccac ctccacaacc 1440 
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accaccacaa gtgcacttgc ccccgtggcc 
cccgtgattg acaagctggc cgagtatgtc 
gttcgtgcca agaatgatca aagatttgag 
tattatgagt ttaagaagca gttcttcctc 
gtgtctgcac cagaagaggc tcccacagac 
gaggatggcg cgcctgaaga cgcagccgag 
aaggaggcat cgtccagtaa gaccgtcccg 
ctggaagatc gcctcgcagc tgctgcccgg 
aaagagaaac agcttcaagc agaacgtaaa 
aaaaatcctc tgccggaagc agaagctggg 
gaatccagca ctacgccctg ccctctactg 
gttaaaccac ccgataggcc ttcgagcaaa 
gaaaagaaaa agaaaaagca caaaaaaaga 
cattcgtcat ccaagtccag gtctagatca 
gcctatcgga cagtgcggcg gtcgaggtcc 
tcccctgaga gacggaggga agagaggagt 
cctggggcca gcaggaagcg gacccgctcc 
cggtcccggt cgcggaccaa gtccaaggcc 
gcagcgcccc ggcccgcggc ccccgcggcc 
gtggagagtc ggggctccag ccaggagcgc 
cagatctctt cagcaatcgt ttcttccgtg 
aaagtcagag cgatgcttgc agcttccaaa 
gccagcggag gcagagccgg gaggctgcgt 
gccacaccat ccacctggcc gcctccatgg 
gatgacattt tgagttttaa gatttctgac 
atatcatgtt tctgtgaaaa tgtattatga 
tagaaaaaaa aaaa 



gccatcatcc ccccgccccc cgacgtccag 1500 
gccaggaacg gcctgaagtt cgagaccagt 1560 
ttcctgcagc cgtggcacca gtataatgct 1620 
cagaaagaag ggggcgatag catgcaggct 1680 
tctgctcccg agaagccaag tgatgctggg 1740 
gtgggagcac gggcaggctc aggcgggaag 1800 
gacgggaagc tggtgaaagc aaagcaaaag 1860 
gaaaagctgg cccaggcgtc taaggagtca 1920 
aggaaagcgg cgttattttt acagaccctc 1980 
aaaattgagg agagtccttt cagtgtcgag 2040 
actggaggca ggcctctgcc tactttagaa 2100 
agcaaagatc caccgagaga agaagagaaa 2160 
tctcgaacaa gatcacgttc tcccaagtac 2220 
cactcaaaag caaagcattc tcttcccagt 2280 
cgctcccggt cccctcggag gagagcccac 2340 
gtgcccactg cctaccgcgt gagccgcagc 2400 
agaagtcccc acgagaagaa gaagaagagg 2460 
aggtctcagt cggtgtcacc cagcaagcag 2520 
cactcggcgc actcagccag cgtctcccct 2580 
tccaggggag tctctcagga aaaagaagcc 2640 
cagagcaaaa tcactcagga tctcatggcc 2700 
aacctgcaaa ccagcgcttc ctgagacggg 2760 
gggcttctgg gcaggctcac gcagacgccg 2820 
acccttggtg gcttttgtaa attaattttt 2880 
cagcagtctc ttacctgtat atttgtaaat 2940 
aataaaatgg gaggaaacac cttttctagc 3000 

3014 



<210> 105 
<211> 772 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 7506335CB1 



<400> 105 

gccgggaccc gccctagggg ctcggatcgc 
ccgtggtccg tggcggcgcg ctccacccgg 
cagacacgcc ggggaaaccg agcgcctcgc 
ggaccccaga caagccccgg agcgcggccg 
tggagaaggc ggacatcctg gagatgaccg 
aggtgacggc cgcgctcagc gccgaccccg 
acgagtgtct ggcggaggtg aaccgcttcc 
tgcgctcccg cctgctgggc cacctggcag 
gcccggcctc gctgtccccg gctgcccccg 
gccgcccgct gctgccatcg ctcggcggcc 
cgggtctgac ccgggcgctg cccgccgccc 
cctggaggcc gtggctgcgc tgaggctgtg 
ccgttctagg gccgtgccct taaccgagag 



cgcgcgctcg ccgctcgccc gccagcccgc 60 
cacggggagg cgcggggcgc accatggccg 120 
cgatggcagg agcgccggcc agcgccagcc 180 
agcaccgcaa gagctcccgc cactcgaagc 240 
tgagacacct gcggagcctg cgtcgcgtgc 300 
ccgttctggg caagtaccgc gccggcttcc 360 
tggccggctg cgagggcgtc ccggccgacg 420 
cctgcctgcg ccagctggga ccctcccgcc 480 
cagaggcccc agcgcccgag gtctacgcgg 540 
ccttccctct gctcgcgccg ccgctgctgc 600 
ccagggcggg gccgcagggc ccgggtgggc 660 
gcctgagact gcactcggat taccggcgcc 720 
gtagcagaga tactattctc tc 772 



<210> 106 

<211> 4206 

<212> DNA 

<213> Homo sapiens 
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<220> 

<221> misc_feature 

<223> Incyte ID No: 5546982CB1 

<400> 106 

cgcccaggtc tctgcgcggc gccgttccca gccgcagtcc gcaggagccg ctgctgcagc 60 
cggcagctgg gcgggattcc cggggtagtc gctgggagca aagctcgctt ctcccgcgcc 120 
tgtgcttttg cctgccccgg cttacttctc cccgcgtcgg tccgctggcc acgtacagct 180 
gaccagctca tcttccagct tcggaccgag gctccacggc cgccgcaggg acccgcccac 240 
cagaccctgc gctcggtagc cggtgtgagt ggaacgtgcc gctgatgaca cctggcagaa 300 
ggcagccatg gagggcgatg gctcagattc gccggttacc attaagaata tcgaaaggga 360 
gctcatttgc ccagcatgca aggagctgtt tacccaccca ttgattctcc cttgccaaca 420 
tagtatctgt cataaatgtg taaaagaact cctgctgact ctcgatgatt cattcaacga 480 
tgtgggatca gacaactcca atcaaagcag tcctcgactt cggctcccct cccctagtat 540 
ggataaaatt gaccgaatta acagaccagg ctggaagcgc aattcattga ccccgaggac 600 
aactgttttc ccttgccctg gctgtgagca tgatgtggat cttggagaac gaggaatcaa 660 
tggtctgttt cgaaacttca ctttggaaac tattgtggaa agatatcgtc aagcagctag 720 
ggcagccaca gccattatgt gtgacctttg taaaccacca cctcaagaat ccacaaaaag 780 
ctgcatggac tgtagtgcaa gttactgcaa tgaatgcttc aaaattcatc acccttgggg 840 
tactataaaa gctcaacatg agtatgttgg tccaactact aacttcagac ccaagatttt 900 
aatgtgccca gaacatgaaa cagagagaat aaacatgtac tgtgaattat gtaggaggcc 960 
agtttgccat ctgtgtaagt tgggtggtaa tcatgccaac caccgtgtaa ccactatgag 1020 
cagtgcctac aaaaccttaa aggaaaagct ttcaaaggat attgattacc ttattggtaa 1080 
ggaaagccag gtgaagagtc aaatatctga actaaacttg ttaatgaaag aaacagagtg 1140 
taatggagag agggctaaag aagaagcaat tacacatttt gaaaagctct ttgaagttct 1200 
ggaagagagg aaatcatctg ttttgaaagc aattgactcc tctaagaaac taagattaga 1260 
caaatttcag actcaaatgg aagagtacca gggacttcta gagaacaatg gacttgtggg 1320 
atatgctcaa gaagtgctaa aggagacaga tcagtcttgc tttgtgcaga cagcaaagca 1380 
gctccacctc agaatacaga aagccacaga atctttgagg agctttagac ctgcagctca 1440 
gacttctttt gaagactatg ttgttaatac ctctaaacaa acagaacttc ttggagaatt 1500 
atcctttttc tctagtggca tagacgtgcc agagatcaat gaggaacaga gcaaagttta 1560 
taacaatgcc ttgataaatt ggcaccatcc agaaaaggat aaagctgata gctatgttct 1620 
tgaatatcgg aaaatcaata gagatgatga aatgtcatgg aatgagatag aagtgtgtgg 1680 
aacaagtaaa ataattcaag acttggaaaa cagtagtacc tatgctttca gagtaagagc 1740 
ttacaagggt tcaatctgta gtccttgcag cagagaattg attcttcata ctcctccagc 1800 
tccagttttcagcttcctct ttgatgaaaa atgtggctat aataatgaac acctcctgct 1860 
gaacttgaag agagaccgtg tagagagtag agctggattt aatcttctgc ttgctgcaga 1920 
acgcatccaa gtgggttatt acacaagctt agactacatc attggagata ctggcattac 1980 
aaaaggaaaa cacttctggg ccttccgtgt ggaaccatat tcatacctgg taaaagtggg 2040 
agttgcttct agcgataaac tacaagaatg gctccgttct ccccgggatg cagttagtcc 2100 
aagatatgag caagacagtg ggcatgacag tggaagtgag gatgcctgtt ttgattcttc 2160 
acaaccattt accttagtta ctataggcat gcagaaattt tttataccca agtcacctac 2220 
ttcttctaat gaacctgaaa atagagttct ccctatgcca acaagtattg ggattttcct 2280 
tgactgtgat aaaggcaaag taaatttcta tgatatggat cagatgaaat gcctttatga 2340 
acgccaagtg gactgttcac atacactgta tccagcattt gcattaatgg gcagtggagg 2400 
aattcagctt gaagaaccca tcacagcaaa atatctggaa taccaagagg acatgtagtt 2460 
gaaacatctg atgtgactca ggataaaaac tgtaaacaaa gcaatgcctt agcgttaacc 2520 
ttcgtaacta attacatatc gctgagtgtt ctgtcacccc acttgtgttt gagtatgatt 2580 
cttcttgaaa tacagaaaat ctaaacagcc ttgtgtcttc atgtcgttgt tctgcctttc 2640 
acaaatgatg aaagtgaaga tgttctttcc aagttaatac tgaagtggtt aactttgcaa 2700 
ctggactgtc ttttacttta tgtacaggat atctaattta tttctgtcag ctggtgttgc 2760 
tgacactttc cttatggtgg gggggataca aaagtattta ttatgattct cttctgtttg 2820 
acttttcatt tttatatcat gtttatgtcc taagagaaga tattttttaa atgccttatg 2880 
gataggaatt agtactaagc aatgtgtctt gcatttgttt tcaataggaa aacaaagttc 2940 
tagataggtt aaaagaaaaa caccataaaa gtggtagtat atttaagtgg aaaattatgt 3000 
attttaaagt gtgtaactat taaaaatata ttgcctggaa aaggaatact actgttttgt 3060 
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tttgttaaga ccacaaactg tattgctaag 
ctcatttaac agcatgggta gtaagcatgg 
aacacattaa acttctcact ttatatactt 
ttattctgtg aaaatatttg ttcttatgtc 
gatggactga tattaaaagt catatttatt 
tcatttttgg cccaattatt acatatcagg 
agttcattat atattttagt agcgttacat 
tcacttatgt ggcattttct gtttctcatt 
aaaatacaaa tttaattact tagaaatgat 
aaaatatctt ttaatttgag gaattacatt 
gtgtgtgctt tcaagtaggt ggtttttatt 
ttcagaaact acttttatgc attgtagcag 
ttaacagggt aaccaaaact tttaggcctg 
ttaatggaaa tctgttactc aggctggtag 
gtagcaaata tcatggagaa cttcagtgtt 
agcactttat gatgtaggct aaactaagca 
aataaagctg ctaaagtcac atttgtggtt 
tgtagacatt tttaaatgtg aagctttaac 
caattgtaaa taaattgttt agctgtgttt 
aaaaaa 



gacattttaa gagtctgctt agccataaca 3120 
tttcttgcag aaaacacaaa gttctcagct 3180 
tttttcagag aaatttatct ttagtttttc 3240 
aaagaattat ttttcagata tattggtaaa 3300 
cttttacccc cttactgtgt tttgtgattt 3360 
ttttcctgat ttatatttac atgtttgaaa 3420 
atattataaa tttttgtaac atttaaatgt 3480 
ggtaataagc atttctaata taagcaagta 3540 
aactaagatc acaatgactg tgtacccttt 3600 
ctatctaaat tgtgtgtctg tttttgaaag 3660 
taaagtactt atttttttga ggttaaaata 3720 
taatcattta tggaatgctt ttgacatgtt 3780 
ttttttttgg ttgtccttgt tgttatattt 3840 
cattaaatta gctttttacc ctcaactcaa 3900 
tagcaggctg ttaatttccc atctatgaat 3960 
aataatgcag cgttatgtgg atcattatgc 4020 
gatactcaag ccttgtacaa gcacatatca 4080 
aggaatagga tagctgtttc aaaccttgtg 4140 
ctttcatttt taataaaaaa ttaaaaaaaa 4200 

4206 



<210> 107 

<211> 1971 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7507432CB1 



<400> 107 

gggcgcgagg gcgcgcctga gccgccatgt cggaggcgcg tggagagcca gggtccgggc 60 
ctgaggctgg cgcccgattc ttctgcactg cgggtcgcgg cctggagccg ttcgtaatgc 120 
gagaggtgcg ggcgcggctg gcggccacgc aggttgaata tatttcagga aaggtttttt 180 
tcaccacctg ttctgatttg aatatgttga agaaattaaa atctgcagaa agattatttt 240 
tgctgattaa aaagcagttt ccacttatta tttcttctgt aagtaaagga aaaatattta 300 
atgaaatgca aagacttata aatgaagatc caggaagttg gttgaatgcc atttcaattt 360 
ggaaaaatct tcttgaactt gatgcaaaaa aggaaaaact ttctcagaga gatgataacc 420 
aactaaaaag aaaagtggga gaaaatgaaa tcattgcaaa gaaattaaaa atagaacaaa 480 
tgcaaaagat agaagagaat agggactgcc agctggaaaa acaaataaaa gaagaaactc 540 
tggagcaaag agattttacc actaaaagcg aaaagtttca agaagaagaa tttcagaatg 600 
acatagagaa agcaattgat actcataatc agaatgactt gactttcaga gtatcttgtc 660 
gctgcagtgg aactattgga aaggccttca ctgcacagga ggtaggaaaa gtaattggaa 720 
ttgctattat gaaacacttt ggatggaaag cagacttgag gaatccacaa ttagagatct 780 
ttatacatct aaatgacatt tactctgtgg tggggattcc tgtgttcagg gtttccctag 840 
ccagcagagc ttacatcaag acagctggac tgcgatctac aatagcgtgg gcaatggcat 900 
ctctggctga cattaaggct ggtgcatttg ttttagatcc aatgtgtgga cttggaacaa 960 
tacttttgga agctgctaaa gaatggccag atgtgtatta tgtaggtgct gatgtcagcg 1020 
actcacagtt actaggtact tgggacaatc tgaaagctgc aggccttgag gataaaattg 1080 
aattacttaa aatctctgtt atagaattgc cattgccttc agaaagtgtt gatattatta 1140 
tttctgacat tccatttggg aaaaagttta agttaggaaa agacatcaaa agcattctac 1200 
aagaaatgga aagagtgctt catgttggcg gaaccattgt attgttgctt agtgaagatc 1260 
accacaggcg ccttacagat tgtaaagaga gcaacatccc tttcaattcc aaggacagtc 1320 
acacagatga acctggaatt aaaaagtgct tgaatcctga agaaaaaact ggtgcattca 1380 
agacagcgtc aacttcattc gaagccagta accacaaatt cttagacaga atgtcaccat 1440 
ttggctcctt ggtaccagtg gaatgctaca aagttagcct tggaaagaca gatgcgttca 1500 
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tatgtaaata taagaagtcg cactcttctg 
gttcaggccc ttgtaagtta gctgtacagc 
cacccacaga gctccaagta accaaattcc 
cacatgcttg gtaagagttt ctgctttata 
agttaccgaa tattttttag acttaagttt 
ccagtttctg agcaaaaact tcaatttcaa 
tatcacaatc aacaccagac gggatatttt 
CttaCgtgta acttcttttg atattaaact 



gactgtagca ggcttgctgc catcagccgg 1560 
agaagtttgc tgtctctagg attcatatag 1620 
cccaaagaca ggaggtgtgg ctgatgggta 1680 
ttctaaagta cttaatgttt ttgagaaaaa 1740 
tatcttctgt cagaatgtga aaagttctgc 1800 
aataaaggtt ttcaggggtt cacttaagtg 1860 
aactatcaga ttttcggaaa tgtaaaaacc 1920 
tgatttttga gcctgtaaaa a 1971 



<210> 108 

<211> 1365 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 5639578CB1 



<400> 108 

acgttgacgc tacctacctc tggcctggcc tctacccact gctgccctgg ccgctaccaa 60 
catgagcctc ccggagagtc ctcacagccc cgctactctc gactatgctc tggaagaccc 120 
gcaccagggc cagaggtccc gagaaaagag caaggcgaca gaggtaatgg cagatatgtt 180 
tgatggccgc ttggagccaa tcgtgttccc accgccccgg cttccagagg agggggtcgc 240 
gccccaggat cccgcagatg gtggccatac tttccacatc ttggtcgatg cgggtcgcag 300 
tcatggagcc atcaaagcgg ggcaggaagt gactccacca cccgcggagg gcctagaagc 360 
ggcctctgcc tcgctgacaa ctgacggcag cctcaaaaat ggctttccgg gtgaagagac 420 
gcacggccta ggtggggaga aggctctaga aacctgtggg gcagggaggt cggagtctga 480 
agtgattgca gaggggaagg ccgaggacgt gaagcctgag gagtgtgcca tgttctcagc 540 
cccagtggat gagaagccag gaggcgaaga aatggacgtg gcggaagaaa acagagcaat 600 
agatgaggta aacagggagg cagggcctgg gcccgggccc gggcccctga acgtgggtct 660 
ccacctgaac cccctggagt ccatccagct ggaactggac tccgtgaatg cagaggctga 720 
cagggcgctc ctgcaggtgg agcgcaggtt tgggcagatt catgaatact acctggagca 780 
gaggaatgac atcattcgca atatcccggg cttctgggtc actgctttcc gccaccaccc 840 
acagctgtct gccatgatta gaggccaaga tgccgagatg ttaagctact taaccaattt 900 
ggaagtgaag gagctcagac accctaggac aggctgcaag tttaagttct tctttcaaag 960 
aaacccttac ttcagaaaca agctgattgt aaaggtgtat gaggtcagat ccttcggcca 1020 
agtggtgtct ttttccactc taatcatgtg gcgccggggc catggacccc agtccttcat 1080 
tcataggaac cgacatgtca tctgcagctt cttcacctgg ttttcagacc acagccttcc 1140 
agagtccgac aggattgctc agattattaa agaggacctc tggtcaaatc cactgcagta 1200 
ctacctgttg ggtgaagacg cccatagagc tagacgtcgc ctggtaaggg agccagtgga 1260 
gatccccagg ccctttgggt tccagtgtgg ttaacctttg cctttggaat tactcctgca 1320 
gaaggttgcc tgccaccacc agctggacct gagcttgggc aacag 1365 



<210> 109 

<211> 5261 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7509080CB1 



<400> 109 

ggaggtcaag atggtggctg taaaactagg 
gacatggaac cggggtgtga cgagttcctg 
agctgggctg aattccaaga cccgcttggc 



atccctgacg attgcttagc attaaggccc 60 
ccgccaccgg agtgcccggt ttttgagcct 120 
tacattgcga aaataaggcc catagcagag 180 
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aagtctggca tctgcaaaat ccgcccaccc 
gttgacaatt tcagatttac tcctcgcgtc 
agagtgaaat tgaactattt ggatcagatt 
ttaaagattc ccaatgtgga gcggaagatc 
aacacacacc cgtttgacaa tgaggtaaaa 
cttagacagt ctgtgcagcc ttcaaagttc 
cagcctgatc cagagcctac agaggaggac 
cagatatatg ggccaggtcc caaaatgatg 
aagactgtgc ataagaaagt cacatgcccc 
ggaggtggga acgtgtcatc aacattgctc 
aagacaacca tgcaacttcg aaagaatcac 
tgccaagtat gctcccgtgg ggatgaagat 
gacaattacc acatcttctg cttgttacca 
aggtgcccaa aatgtatctt ggcggagtgt 
caggctaccc aggagtacag tttgcagagt 
gactacttca acatgcctgt acatatggtg 
aggctggtga gcagcattga ggaagacgtg 
aaagaatttg gcagtggctt tcctgtcagc 
aaggagtatg cgaccagtgg ttggaacctg 
ctctgtcaca tcaatgcaga catctcaggc 
gttttctcag cattttgttg gcatattgag 
cattggggtg agccgaagac ctggtatggt 
gaggtgatga agatgctgac acctgagctg 
cttgtcactc tcatgaatcc caacactttg 
aaccagtgtg caggggagtt tgtcatcact 
caaggctaca attttgctga agctgtcaac 
cgccagtgca ttgaacacta ccgccggctc 
ctcatctgca agatggctgc cttcccagag 
cacaaggaga tgttcattat ggttcaggag 
aagggcgtca cggaggctga gcgagaggct 
tgcatcaagt gcaagaccac gtgcttcttg 
ggccttgtat gcctttccca catcaatgac 
ctccggtatc ggtacacctt ggatgagctc 
gctgagtctt ttgacacctg ggccaacaaa 
cgtaaacgca gctttgaaga gctaagggca 
cctaatagtg agctgcttca gcgactgaag 
gctcaagtcc tggggctggt cagtggtcag 
ttgactgaac tccgggtcct tcttgagcag 
attggggatg tcaaggatgt cctggaacag 
gctctggcca cactgccctc tagtccaggg 
cagctgggtg tagaggtgcc tgaagcccat 
tggctagatg aagtgaagca ggccctggcc 
atgcaggggc ttttggttat gggtgccaag 
cgggctgagc tgcaagaact actgaccatt 
tgcctggagg ccaggcagaa gcatccacca 
gaaaacatcc ctgttcacct gcctaacatc 
caagcttgga ttgctgatgt ggatgagatc 
gacttggagg gcctggtggc tgtgggccgg 
cagctagagc tgcaggtatt gacagcacat 
ctcaagaaga attcttgcta cacactgctt 
ctcttcttca tctggcctgg ctgctgtgag 
ttagtgggca cctgggtagg aaggagaggg 
ctgtatgtct gcctgcctgc ctcaggtgct 
caccaagcgt agccggtgga tggagaaggc 
gctggggctg tctgcacagg acctcagaga 
aggggaacag aaggagaagg agggtatcct 



gcggattggc agcctccttt tgcagtagaa 240 
caaaggctaa atgaactgga ggcccaaact 300 
gcaaaattct gggaaattca aggctcctct 360 
ttggacctct acagccttag taagcaatgt 420 
gataaggaat acaagcccca cagcatcccc 480 
agcagctaca gtcgacgggc aaaaaggcta 540 
attgagaagc atccagagct aaagaagtta 600 
ggcttgggcc ttatggctaa ggataaggat 660 
ccaactgtta cggtgaagga tgagcaaagt 720 
aagcagcact tgagcctaga gccctgcact 780 
agcagtgccc agtttattga ctcatatatt 840 
gataagcttc ttttctgtga tggctgtgat 900 
ccccttcctg aaatccccag aggcatctgg 960 
aaacagcctc ctgaagcttt tggatttgaa 1020 
tttggtgaaa tggctgattc cttcaagtcc 1080 
cctacagaac ttgtagagaa ggaattctgg 1140 
acagttgaat atggagctga tattcattcc 1200 
aatagcaaac aaaacttatc tcctgaggag 1260 
aatgtgatgc cagtgctaga tcagtctgtt 1320 
atgaaggtgc cctggctgta cgtgggcatg 1380 
gatcactgga gttactctat taactatctg 1440 
gtaccctccc tggcagcaga gcatttggag 1500 
tttgatagcc agcctgatct cctacaccag 1560 
atgtcccatg gtgtgccagt tgtccgcaca 1620 
tttcctcgtg cttaccacag tggttttaac 1680 
ttttgtactg ctgactggct acctgctgga 1740 
cggcgctatt gtgtcttctc ccacgaggag 1800 
acgttggatc tcaatctagc agtagctgtg 1860 
gagcgacgtc tacgaaaggc ccttttggag 1920 
tttgagctgc tcccagatga tgaacgccag 1980 
tcagccctgg cctgctacga ctgcccagat 2040 
ctctgcaagt gctctagtag ccgacagtac 2100 
cccaccatgc tgcataaact gaagattcgg 2160 
gtgcgagtgg ccttggaggt ggaggatggc 2220 
ctggagtctg aggctcgtga gaggaggttt 2280 
aactgcctga gtgaggtgga ggcttgtatt 2340 
gtggccagga tggacactcc acagctgacc 2400 
atgggcagcc tgccctgtgc catgcatcag 2460 
gtggaggcct atcaagctga ggctcgtgag 2520 
ctattgcggt ccctgttgga gagggggcag 2580 
cagcttcagc agcaggtgga gcaggcgcaa 2640 
ccttctgctc acaggggctc tctggtcatc 2700 
atagcctcca gcccttctgt ggacaaggcc 2760 
gcagagcgct gggaagaaaa ggctcatttc 2820 
gccacattgg aagccataat tcgtgagaca 2880 
caggctctca aagaagctct gactaaggca 2940 
caaaatggtg accactaccc ctgtctagat 3000 
gacctgcctg tggggctgga agagctgaga 3060 
tcctggagag agaaggcctc caagaccttt 3120 
gaggtgaggt ctgagaccct gacccacagc 3180 
atggcgcata taatgagaac atagattttt 3240 
tgtagttggt gagggaagcc tggtcatttc 3300 
ttgcccgtgt gcagacgctg gctcagacag 3360 
gctggggttg taccagtgtg acacagagct 3420 
cccaggctct gtgattgtgg ccttcaagga 3480 
gcagctgcgt cgcaccaact cagccaagcc 3540 
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cagtccactg gcaccatccc tcatggcctc ttctccgact tctatctgtg tgtgtgggca 3600 
ggtgccagct ggggtgggag ttctgcagtg tgacctgtgt caggactggt tccatgggca 3660 
gtgtgtgtca gtgccccatc tcctcacctc tccaaagccc agtctcactt catctccact 3720 
gctagcctgg tgggaatggg acacaaaatt cctgtgtcca ctgtgtatgc gctcacgacg 3780 
gccacgccta gagacaatcc tagccttgct ggttgccctg cagaggctgc ccgtgcggct 3840 
gcctgagggt gaggcccttc agtgtctcac agagagggcc attggctggc aagaccgtgc 3900 
cagaaaggct ctggcctctg aagatgtgac tgctctgttg cgacagctgg ctgagcttcg 3960 
ccaacagcta caggccaaac ccagaccaga ggaggcctca gtctacactt cagccactgc 4020 
ctgtgaccct atcagagaag gcagtggcaa caatatttct aaggtccaag ggctgctgga 4080 
gaatggagac agtgtgacca gtcctgagaa catggctcca ggaaagggct ctgacctgga 4140 
gctactgtcc tcactgttgc cgcagttgac tggccctgtg ttggagctgc ctgaggcaat 4200 
ccgggctccc ctggaggagc tcatgatgga aggggacctg cttgaggtga ccctggatga 4260 
gaaccacagc atctggcagc tgctgcaggc tggacagcct ccagacctgg acagaattcg 4320 
cacacttctg gagctggaaa aatttgaaca tcaagggagt cggacaagga gccgggctct 4380 
ggagaggcga cggcggcggc agaaggtgga tcagggtaga aacgttgaga atcttgttca 4440 
acaggagctt cagtcaaaaa gggctcggag ctcagggatt atgtctcagg tgggccgaga 4500 
agaagaacat tatcaggaga aagcagaccg tgaaaatatg ttcctgacac cttccacaga 4560 
ccacagccct ttcttgaaag gaaaccaaaa tagcttacaa cacaaggatt caggctcttc 4620 
agctgcttgt ccttctttaa tgcctttgct acaactctcc tactctgatg agcaacagtt 4680 
gtgacagtgg caccaaaggt catttgtggt tgtttttgtt tgtttgtttc ttaaatccta 4740 
ctatctcctg gcctggacct cagaaggagc tttttgccta tctataattt ttcactgcca 4800 
atttttgata tcctctctcc tagagttact gttaaaaggt tggttcgtaa agtccacacc 4860 
ccgatgctca gaagtgtctt gccagcaaca ttcctgctag catacaggag tgatttccta 4920 
aaccagtttc attctagtct gaatagggac aaacaaatct tgaggaagcc caagtgcgta 4980 
cctttatttt tgcccccacc accctctttc tgtacttcaa tttttgtttg ttttttgttt 5040 
ttttgtccct gtcataaaat attttggtgc ttcaaaactt gtaccttcat tgtacatcct 5100 
tttcttttct ccccttgggt cttattataa aagaagacaa tgtacgttgt aattaccaaa 5160 
aagaataggg aaaaacaaga atttcatgac tctacctgtg gtctatcttt aatttcattt 5220 
cttttgttaa aaataaaaca atgagtatgt ttggatacta a 5261 

<210> 110 
<211> 814 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7505899CB1 

<400> 110 

gctagttcta gatcgcgagg tccgcccttt tttttttttt tcctccctgt cgccaccgag 60 

gtcgcacgcg tgagacttct ccgccgcctc cgccgcagac gccgccgcga tgcgctacgt 120 

cgcctcctac ctgctggctg ccctaggggg caactcctcc cccagcgcca aggacatcaa 180 

gaagatcttg gacagcgtta tcagtgagct gaatggaaaa aacattgaag acgtcattgc 240 

ccagggtatt ggcaagcttg ccagtgtacc tgctggtggg gctgtagccg tctctgctgc 300 

cccaggctct gcagcccctg ctgctggttc tgcccctgct gcagcagagg agaagaaaga 360 

tgagaagaag gaggagtctg aagagtcaga tgatgacatg ggatttggcc tttttgatta 420 

aattcctgct cccctgcaaa taaagccttt ttacacatct aaaaaaaaaa aaaaaaagcg 480 

gccgcttttt tttttttttt gagatgtgta aaaaggcttt atttgcaggg gagcaggaat 540 

ttaatcaaaa aggccaaatc ccatgtcatc atctgactct tcagactcct ccttcttctc 600 

atctttcttc tcctctgctg cagcaggggc agaaccagca gcaggggctg cagagcctgg 660 

ggcagcagag acggctacag ccccaccagc aggtacactg gcaagcttgc caataccctg 720 

ggcaatgacg tcttcaatgt tttttccatt cagctcactg ataacgcggc ggcgtctgcg 780 

gcggagaagt ctcacgcgtg cgacctcggt ggcg 814 

<210> 111 
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<211> 2637 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7505904CB1 

<400> 111 

aggctctcta ccggtgaggg tttgcgggga agatggagta tcccgcgccg gccacggtgc 60 
aggccgcgga cggcggagcg gccgggcctt acagcagctc ggagttgctg gagggccagg 120 
agccggacgg ggtgcgcttt gaccgcgaga gggcgcgccg cctgtgggaa gccgtgtccg 180 
gtgcccagcc ggtgggtaga gaggaagtgg agcacatgat ccagaagaac caatgtctct 240 
tcaccaacac ccagtgtaag gtttgctgcg ccttgcttat ttctgagtcc cagaagctgg 300 
cacattacca gagcaaaaaa catgccaaca aagtgaagag atacctagca atccatggaa 360 
tggagacatt aaagggggaa acgaagaagc tagactcaga tcagaagagc agcagaagca 420 
aagacaagaa ccagtgctgc cccatctgta acatgacctt ttcctcccct gtcgtggccc 480 
agtcgcacta cctggggaag acccacgcaa agaacttaaa gctgaagcag cagtccacta 540 
aggtggaagc tggaaagggc tacccctgca aaacatgtaa gatagtgctg aactccatag 600 
aacagtacca agctcatgtc agcggcttca aacacaagaa ccagtcacca aaaacagtgg 660 
catcatccct gggccagatt ccaatgcaaa ggcaacccat tcagaaagac tcaaccacct 720 
tggaagacta gaggtgattc tgcccagcat cccatattgg gccagccatg agccagcttc 780 
ccgtgactgc tcagcccttg gctccctctt gctcgttgtt ctcaccagga aagtacacgg 840 
gcctgaggca ggattgggcc acagacagcc tctcattggt ccgggctaat tcactcctgc 900 
tgctcccctt tggcaggggt cctgtaggtc atgacagggg aggcaagggt attgagagac 960 
tcggggtctc gcggggtggt agtttggagg gtggctttcc ccatttccca acccctctgg 1020 
gccttaggtg ctgaggcccc tgccacctgt ctttcctcta aaggtcagtt ttgggccagt 1080 
tcttgcaact aaagagcaga gatctctctg ggccctagac atttccagca aaacctggaa 1140 
ctttcatgcc aaacctgggg cagggcagga aacagaggaa atggctgcaa catgggagct 1200 
tggagctaat acgacactct gccttccccc agaaggtgca ggctttcctg agtcttagac 1260 
cagatatggc cagttgcgca ggtttctgcc aactgtgaag tatcctcctg gagcagtgac 1320 
acaatcttgg cggagcattg ctacccccgc tgccccctcc acagttcctg aatggtgcta 1380 
aggatctgca gcagttggca acgagctggg gctgggggcg gcctccatgt ccactgagat 1440 
cataggacac tccaatgggg atgggacctt tcccctcctc catcagaggt gctctgaccc 1500 
taggttacac gggaaagtgc cccacatgca agtctccctg agggttctgc ccctaaaggc 1560 
agactgcctc atgcccgcta gctgtgaggt tcattgctac cctcggccct actagccctc 1620 
tcttccccct tgtgcagcgg accacttgcc cagtttgctg tggtgctagc cttccccatc 1680 
atccaccggg tgatttctgg gtcccaggga aagaaagaga gagctgatgc aggtttctac 1740 
agtgaggaac aggcgtttcc caggccccac acccagattt ctctatcttt gctgtgtttt 1800 
atggcctggg actgagtcca cacggataga tttttccttg taaccttgag acgagaattc 1860 
caaggagtgt caccatcaga ggcttctctt cattgtgtca aagaagcccc tagctgctct 1920 
cgtggcctcc ttcccccact ccctatccct tcacctgtga aatgcctttg ctttgcatat 1980 
tgtgtgtgga tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtatgtgtg tgcttctgtg 2040 
tgtgcctaat gctctgtctc ttggtcactg aagcatccaa ataaagaatt tccctcatgg 2100 
gccagactac aacttaacag acctagggtt ttgagccata ttgccattgg atctagtatg 2160 
tctgtcttga cccctcaccc ccaccccatt cagacctcag tagtccccag tagtcacttt 2220 
aggagttggg acagagtcga gcaccctgtg aaagtctccc ttgagttcag gtgccctgaa 2280 
ctcagtacag cctcatcttc tcatccacca ccttctcctg cttctcctgt cacatagggc 2340 
ctcatgaagg tctttaaagg gtttaagctg gggatctgct ttatgtgtga gagaactgtt 2400 
ctggctgtta tgtaaaggag tgcagggagg ggcaggaaag gagacaagga ggccagtatg 2460 
gacactgtgt ccagtatggc tttccaggct aatcacttag ggcagagacc cagccttatt 2520 
catctctact tcctgatgtc catagtggtg tgtcctgaat tcttattgaa tgaccaatga 2580 
cgtgtctcag gagaaatggt tctcctgccc agtctaaccg gatagtttgt ccggggc 2637 

<210> 112 
<211> 2699 
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<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 7509224CB1 

<400> 112 

ccgctttccc ggctccgtcg ctgacgcgtc 
agcgggatcg ggatgaacag cggcggcggc 
cccacgtcgg tgattcaggt gacgaatctg 
acgctttttt ccttcctagg agaaatcgag 
cctcttgctt tttcctccaa agtatgttat 
gtggcccagc atctaactaa cacggttttt 
gcagaagtga tgcagccaac acgaacggtt 
gaaaacaccg tcctgggaaa gcgtccatgc 
ttccagtaca ggtaaaatcc cagaggaatc 
aaccatgaca agtctgatgc ctggtgcagg 
gactactctt ggtgtttcac ttagcagttt 
caacattgca acacttggag agataccaca 
caaaatagat gaaattagga gaacggttta 
agctgatcaa ctacttgaat tttttaaaca 
aggtgatgag actcagccaa ctcggtttgc 
accaagggcc cttgctttta atggagttat 
ctccaacaat gcaatagtaa aaccccctga 
agaagaagta atgaagcgag tacgagaagc 
agagtctgga aagagcaatg aaagaaaagg 
atccaggtct agctcaaaat cccattctag 
tagatcccat aatagatcac gttcaagaca 
taaaaaacgc tctaaatcaa gggagagacg 
caagagaaaa gacactcgag aaaagatcaa 
ggaaaaggag agagagaggg aaaaggaacg 
agaccgggac aaggaacggg aaaaggaccg 
agaacgggaa aaagagcatg agaaggatcg 
agaaaaggaa cgagaaaaag acagatccaa 
aaaatccaga acaccaccca ggagttacaa 
ggaaaggcgt aggaggagga gcaggagttc 
aaaaaggaaa tcttctagat ctccgtcccc 
agaaaaagaa agggaccaca tcagtgaaag 
aaagagttct aatgatagag atgggaagga 
agagaaagag cacaataaag aaccagattc 
tgcaccaagg actgaggaaa acaaaataca 
aaacctctct accaaaacaa aagcagtata 
ctggaatcaa atccaaagct tttaattctc 
gttgagcatg aagataggat ctaacagctt 
ttgttattga gtaagaaaat aaagcatgga 
tcatcttcta aaatctgtgc atttccatgg 
gtgtttgcca agaaccattg caaataaatt 
ctaaagactg gagataagca ttggaggctc 
atcgtttcta cttcttcttc ttattttcca 
aattggtgca atatatacac acccttgtta 
ttacttccca gggacttgcg cactggcgct 
ctgtcgacgc aactctcctc taaaaaagtt 

<210> 113 
<211> 1569 



gtagacgttg gggagcggga aggcaacggc 60 
ttcggtttgg gcttaggctt cggcctcacc 120 
tcgtcggcgg tgaccagcga gcagatgcgg 180 
gagctgcggc tctacccccc ggacaacgca 240 
gttaagtttc gtgatccatc aagtgttggc 300 
attgacagag ctctgatagt tgttccttgt 360 
gtcatgtgta acacaacttt cgatcaccgc 420 
ttgatatcgt ttggttcatg aacattaagt 480 
caaagccctc tctttattgg ctcctgctcc 540 
attgcttcca ataccgaccc caaatccttt 600 
gggagctata ccagcagcag cactagaccc 660 
gccaccactt atgggaaacg tggatccttc 720 
tgttggaaat ctgaattccc agacaacgac 780 
agttggagaa gtgaagtttg tgcggatggc 840 
ttttgtggaa tttgcagacc aaaattctgt 900 
gtttggagac aggccactga aaataaatca 960 
gatgacacct caggctgcag ctaaggagtt 1020 
tcagtcattt atctcagcag ctattgaacc 1080 
cggtcgatct cgttcccata ctcgctcaaa 1140 
aaggaaaaga tcacaatcaa aacacaggag 1200 
gaaagacaga cgtagatcta agagcccaca 1260 
gaagtcaagg agtcgttcgc attcacggga 1320 
ggaaaaggaa agagtgaaag agaaagacag 1380 
tgaaaaagaa aaggaacggg gtaaaaacaa 1440 
ggaaaaagac aaggaaaagg acagagagag 1500 
agacaaagag aaggaaaagg aacaggacaa 1560 
agagatagat gaaaaaagaa agaaggataa 1620 
tgcatcgcga agatctcgta gttccagcag 1680 
ttccagatcg ccaagaacat caaaaaccat 1740 
caggagcaga aataagaagg ataaaaagag 1800 
aagagagaga gaacgttcaa cgtctatgag 1860 
gaagttggag aagaacagta cttcacttaa 1920 
aagtgtgagc aaagaagtag atgacaagga 1980 
gcacaatggg aattgtcagc tgaatgaaga 2040 
ggaccgacaa gtgtacctct gcactcaatg 2100 
tcaacaagat gtaaacagga aagaaatcta 2160 
ttccagttgt tagatgactt tgtggccatc 2220 
catcatgaaa ataacagatg ttacccaaac 2280 
tggctgacac acttgtcatg tggtctgtta 2340 
gaacatcaaa gatccaagtt tgtactatcc 2400 
ttttaaaaaa tgctagttac tgaatttcgt 2460 
atatatacag tttgatggat gtcgcttgga 2520 
gttgccaagt atgtaagaaa gttttaacat 2580 
caattccccc tattgggtct tctttcgccc 2640 
gccataaatg ccaaaaaggc gtaaaattc 2699 
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<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7505922CB1 

<220> 

<221> unsure 

<222> (1) ... (1569) 

<223> a, t, c, g, or other 



<400> 113 

gaggacgaag ccagttctct ttttctggtc 
cccgccccan agaaatcccc anccagcgtt 
cacagcagtc cgtgccgccg tcccgcccgc 
ccgcggccca gcgcacccgc agcagcgccc 
cgagcgcccc caggagtggg ccatggaggg 
actggacgac cgccacgaca gcggcctgga 
ggtcaaggag ctgcaggaga tccgcctcga 
ctggaagcag cagctcaccg aggacgggga 
agaaaaggca ctgaccatgg aagtgatccg 
cttccagaac aacctgcagc agactccact 
aattgctgag gcacttctgg gagctggctg 
taccccccta caccttgcct gtgagcaggg 
gtcctgcacc accccgcacc tccactccat 
gtgtctacac ttagcctcta tccatggcta 
gggtgctgat gtcaatgctc agctcacctg 
gctgggccag ctgacactag aaaaccttca 
ctatgacaca gagtcagagt tcacggagtt 
tgtgtttgga ggccagcgtc tgacgttatg 
gtatatttgt acaaaaaaaa agttttattt 
taaagggtgt acttatatcc acactgcaca 
ggatcagccc tcattttgtt gcttttgtga 
aaattctgag aaaacttctt ttaaacctca 
aaaatttcat ggaaggacca cattttatat 
gtatcctgtg acatgtaaca gccaggagtg 
tactacctgt caaggtttgt gttaccctcc 
aattattttg gtacttttat gatgtatatt 
aaaaaaaaa 



tgactggctt ggaaattccc cgagcctgan 60 
tatagggcgc cgcngcggcg ctgcagagcc 120 
cagcgcccca gcgaggaagc agcgcgcagc 180 
gcagctcgtc cgcgccatgt tccaggcggc 240 
cccccgcgac gggctgaaga aggagcggct 300 
ctccatgaaa gacgaggagt acgagcagat 360 
gccgcaggag gtgccgcgcg gctcggagcc 420 
ctcgttcctg cacttggcca tcatccatga 480 
ccaggtgaag ggagacctgg ccttcctcaa 540 
ccacttggct gtgatcacca accagccaga 600 
tgatcctgag ctccgagact ttcgaggaaa 660 
ctgcctggcc agcgtgggag tcctgactca 720 
cctgaaggct accaactaca atggccacac 780 
cctgggcatc gtggagcttt tggtgtcctt 840 
gggccgccca agcacccgga tacagcagca 900 
gatgctgcca gagagtgagg atgaggagag 960 
cacagaggac gagctgccct atgatgactg 1020 
agcgcaaagg ggctgaaaga acatggactt 1080 
ttctaaaaaa agaaaaaaga agaaaaaatt 1140 
ctgcctggcc caaaacgtct tattgtggta 1200 
actttttgta ggggacgaga aagatcattg 1260 
cctttgtggg gtttttggag aaggttatca 1320 
ttattgtgct tcgagtgact gaccccagtg 1380 
ttaagcgttc agtgatgtgg ggtgaaaagt 1440 
tgtaaatggt gtacataatg tattgttggt 1500 
tattaaacag atttttacaa aaaaaaaaaa 1560 

1569 



<210> 114 
<211> 2852 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7507695CB1 



<400> 114 

gaagtaccgg tcagccagcc tcctgctgca 
ctgcgcacat aggcagtact catcttggcc 
aacataaagc ttggatgggg tttgacctct 
ctgctgagca gagaagatga ctgcagaatt 
cccagtgaag gtgaaaaagg aggaggaaga 



gagccagtga actcaggtcg ggcttctcag 60 

ctgggaagaa actcaagaag aagcttttga 120 

gcagggcagc gcccagctat aggagttccc 180 

gagagaagcc atggccctag ccccatgggg 240 

agaagaaaac ttcccaggtc aggcatccag 300 
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ccaacaagtg 
ccttgatgga 
cctgaaagca 
gactctggcc 
aaccaagaac 
cctcatcata 
ttgtgaaaag 
acaagtgata 
atgtaagaaa 
ccagcttgga 
aaacctcgtt 
tgggaaggcc 
gaaaccttat 
acatcaaaaa 
tagttacagc 
atgtaatgaa 
ccacactggg 
aaatcttatt 
tgggaaagcc 
gaaaccttac 
ccaccagaga 
cacatgtagc 
atgtaatgag 
ccacactggg 
acacctcatg 
ctccagcatt 
ctcctctata 
agacacaaga 
ggactttgtt 
taggaatggg 
cttaattttg 
gttgaattct 
aaaccacttg 
ctaggtaatt 
gaaatggact 
gacctctggg 
ggacaccaca 
caagccccca 
gatgaaggag 
ttctgctcaa 
tttagaagaa 
agaggggagg 
gtattggaat 



cactccgaga 
ccagaagagg 
actcaggagg 
aagagtgaga 
ctacagttac 
tcagaaagaa 
ggaaacatgt 
gtgaatgact 
tctggaggaa 
caaaagcctt 
gtgcatcagc 
ttcattcaga 
gtttgctcaa 
atccactcct 
tcacaacttg 
tgtgggaaaa 
gagaagccct 
gtacatcagc 
tttagttgtt 
gactgcagcg 
attcacagtg 
tcatacctac 
tgtgggaagg 
gagaagccct 
cgataccata 
ggtcataacc 
aaagtgaggg 
aaagcatttc 
ttaactgtac 
tcgtacaaag 
ttttttaaaa 
ggttgaaccg 
gggtagcagt 
ttttaaaagc 
ttttccttac 
tctgttctga 
caccagctat 
attcccatcg 
tcaaagaata 
gtccagccag 
atagtggagc 
tggaaataag 
atgaaaaact 



acatcaaagt 
aagaaaaggg 
cacctgctgc 
tactggagac 
tggttccaaa 
tccagaaagc 
taaagaggca 
gtcacttacc 
aatatagcct 
ttacgtgtag 
gaatccacac 
gtgcaaacct 
aatgtgggaa 
tagaaaaaac 
ctcggcacca 
catttactag 
ttgcctgtaa 
gaagccatac 
tttcacacct 
aatgtgggaa 
gagatcttcc 
ttattcatca 
ccttcagaca 
atgaatgtga 
gaacccatct 
ttctgcctcc 
ctgtgtcctt 
agaggctaat 
cttaagcggt 
ctaagtggta 
ctctgatgat 
tgtattctaa 
tcaacaattc 
cagtcaaaac 
tgtctatacc 
ccattcccta 
attcatttgc 
cattccattt 
aagcttgcct 
gggtctcagt 
tcttctcatt 
attcctaagg 
tttagaagga 



ctgggcccca 
tcagaacata 
ttcaaccctt 
tcatgggacc 
aactgagata 
tgatcctcaa 
gagaataaag 
tgaaagcttc 
taattctggc 
cgtgtgtggg 
tggagagaaa 
cgttgtgcat 
agccttcact 
ttttaagtgc 
gaaagtccac 
gagctcaaac 
cgactgtggc 
tggtgagaag 
tattgtgcac 
agccttcagt 
ttacgtgtgt 
gagaattcat 
gaggtcgagc 
gaagtgtggt 
tgttgaataa 
ctaatgagac 
aaaagttata 
ttaaaacaaa 
catgttctct 
atgatgctat 
tgcttgagca 
tatttctggt 
acttacgaat 
aaaaacttta 
tcctgaacct 
gtctccatgg 
caagatcaac 
gggtgagatg 
agaggcatgc 
ttgggtttca 
ctgagaagac 
tcttaatttg 
eg 



gtgcagggtc 
tcctgggata 
ggcagctact 
atgaactttc 
tgtgaggaag 
ggacctgagt 
agagaaaaga 
aaagaagagg 
gctgttaaaa 
aaaggattta 
ccctttgaat 
cagagaatcc 
cagagttcaa 
aatgaatgtg 
attaeggaaa 
ctcattgtcc 
aaagecttta 
ccatatgagt 
cagagaattc 
cagctctctt 
aatgaatgtg 
aatggagaaa 
ctcaccgtgc 
gcagctttca 
caagtaagga 
acctctttgc 
gttttcagga 
aagtaagcac 
gaagtggaat 
ttggggaaag 
acaggcaggt 
taagtggtga 
gtttataagc 
ctgaaaatgg 
tggtattgta 
ccaagcactc 
agctccttct 
caactaacag 
cccagttgtc 
ttctcacatt 
caaagaggga 
attttttgag 



ttcagacagg 360 
tggcggtagt 420 
cattaccagg 480 
taggtgctga 540 
ctgaaaaacc 600 
taggagaagc 660 
aagatttcag 720 
aaaaccagaa 780 
atccaaaaac 840 
gtcagagtgc 900 
gtcatgagtg 960 
acactggaca 1020 
atctgactgt 1080 
agaaagcett 1140 
aatgctatga 1200 
accagaggat 1260 
cccagagtgc 1320 
gtaaagagtg 1380 
acactgeaga 1440 
gecttattgt 1500 
ggaaggcett 1560 
aaccttacac 1620 
accagagaac 1680 
tttccaactc 1740 
agaggaagac 1800 
tgttttcttc 1860 
atgeagcaga 1920 
ctaaaggcaa 1980 
gtggctttcc 2040 
gtcttttttg 2100 
tatctgectg 2160 
ctgggtaagg 2220 
tttccatttc 2280 
acagaaatag 2340 
aagatctggg 2400 
aaggattgat 2460 
ccaaacaact 2520 
ccccttcctg 2580 
tetgetgtea 2640 
ctggatggtg 2700 
aagtgatggg 2760 
aaattcagag 2820 
2852 
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